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PREFACE 


blems of psychology. To ask how much of what 
volutionary history and how much depends on 
wn lifetime is to ask what it can learn. Research 
ly simple procedures with animals to the com- 
blem-solving. Procedures in these areas are so 
that the temptation is great to restrict 
or and learning, or solely to issues 


Learning is central to the pro 

An Organism does depends on its e 
What it has experienced during its 0 
i) learning has ranged from relative 
ক) of human language and prob!‘ 
erent, and the literature so extensive, 


One’s attention solely to issues of animal behavi 1 lear L S 
of human learning and memory. Many texts in learning yield to this temptation. 


In tum, the study of learning has become more and more divided, with each 
and research methodologies. In this book, 


Pproach developing its own languages 
Part of my purpose has been to bring these literatures together, and to explore 
Some of the continuities between human leaming and the learning of other orga- 
isms. Humans are undoubtedly unique, but they share an evolutionary heritage 
With other species. The properties of animal learning are therefore likely to be 
relevant to earning in humans. Even if we only show that some kinds of human 
Caming are not reducible to types of learning known l occur with other 
Rn; we have at least begun to define = Peology of eaming from a 
Consistent behavioral point of view. outline the nature of a 
ntl Orientation here. That view 
» and is better treated in the context 


0 
Which wi £ I 
কং ill be taken up later in the text. 7 W 

Position one need not xed from consideration aspects of human behaving 


ometimes called mental, such as thinking and imagining. Thus, topics 3 
As the exclusive HIOVInGE of contemporary cognitive psychology will be treate 


0 ্‌ 3 
18 With those more traditionally regarded as behavior ns. Chapters 2 through 
In its overall structure, this book has two major Sections. p 


og over basic topics in animal behavior and learning, and el 10 ত i 
f 'Ver those in human learning and memory. In some places, these sectio 1 
arly independent. but more often the concepts developed in the first section are 


of specific psychological issues, some 
ill only note that to take a behavioral 


ix 
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prerequisites for the treatment of more complex issues in the second section. I 
have made a special effort to include examples of human behavior in discussing 
the relevance of animal learning studies, and to refer to appropriate concepts from 
animal behavior in discussing human learning and memory. The choices were to 
some extent dictated by the logic of the subject matter and by the availability of 
appropriate cases in the relevant research literature. 

To study learning, one should know what learning is. In chapter 1, therefore, 
this book begins with the problem of defining learning. In that context, this intro- 
ductory chapter also treats some aspects of technical vocabulary and of experi- 
mental method. Chapter 2 is organizational, providing an outline of the topics 
examined in detail in chapters 3 through 9. Chapter 3, on elicitation, covers the 
reflex and other relations generated when stimuli are presented to organisms. 
Other texts typically introduce conditioning at this point, but within the present 
Organization that topic is more effectively deferred until later. 

Chapters 4 and 5 show how the consequences of responding can affect 
behavior, in reinforcement and in aversive control. These topics raise the issue of 
classes of responses and classes of stimuli as behavioral units. They therefore lead 
to the concept of the operant, in chapter 6 on the differentiation of responses, and 
the concept of the discriminated Operant, in chapter 7 on stimulus control. Some 
implications of these concepts are illustrated in chapter 8, which examines the Ways 
in which behavior can be maintained over extended periods of time by reinforce- 
ment schedules. Chapter 9 then takes up the properties of conditioning and shows 
how they may be related to processes discussed in the preceding chapters. 

In its behavioral treatment of language, chapter 10 provides a bridge to the 
complexities of human learning and memory. Some features of behavioral and 
cognitive approaches are explicitly compared in chapter 11, on psycholinguistics. 
These two chapters set the stage for an examination of verbal learning and transfer, 
in chapter 12, and memory, in chapter 13. Cognition and problem-solving, in 
chapter 14, Present an opportunity for synthesis, because they bring together 


topics considered separately at various points throughout the text. The final 
chapter attempts an overv 


iew and integration of some major issues in the psychol- 
Ogy of learning. 


Throughout its history, 
theories. Particular learning theories were developed, elaborated, and then dis- 
placed by others. Many rem 


than when they were first i 
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much on the interpretation of particular findings as on the relations among the 
varied phenomena included within the psychology of learning. My intent has been 
to make the book useful not only to those who are already behaviorally inclined, 
but also to those who are decidedly not of the behavioral persuasion . 

This book has been an effort of some years. Its content grew over successive 
offerings of an introductory lecture course in the Psychology of Learning, first at 
the University College of Arts and Science of New York University and then at the 
University of Maryland Baltimore County. I am indebted to my students and 
colleagues at both campuses; I must especially mention Robert E. Silverman, T. 
James Matthews, Barry Schwartz, Kenneth Keller, Rona Yohalem, Jacob Gewirtz, 
and Eliot H. Shimoff. As my teachers and colleagues, numerous others contributed 
by their comments, discussions, and encouragement. I name only the following, 
mainly because I can still identify particular contributions of each: Leonard Cook, 
Willard F. Day, Lewis R. Golub, Eliot Hearst, Ralph F. Hefferline, Richard J. 
Herrnstein, Herbert M. Jenkins, Victor G. Laties, J. A. Nevin, Howard Rachlin, 
George S. Reynolds, B. F. Skinner, William C. Stebbins, S. S. Stevens, Stephanie 
B. Stolz, Herbert S. Terrace, and Thom Verhave. In such a listing, omissions are 
inevitable, but to a singular group of forty-odd unnamed others it will come as no 
revelation that at least six names remain deliberately secret. I wish also to express 
my appreciation for the invaluable help of Geraldine Hansen, Jill Himmer, and Lee 
Ratner, in preparation of the manuscript, and of Jack Burton, Neale Sweet, Ted 
Jursek, John Isley, Alison Gnerre, and Paul Spencer, in guiding the book through its 
Successive stages from conception to publication. J 

Above all, however, one further acknowledgment remains. I cannot say how 
much I owe to the teaching and the writing of Fred S. Keller and W. N. Schoenfeld. 
Their courses and their Principles of Psychology introduced me to the analysis of 
behavior. I hope that this book is true enough to their teaching that they will 


recognize something of themselves in its pages. 


A. Charles Catania 
Columbia, Maryland 
October 1978 


Introduction 


ONE 


The English word learning comes from an Indo-European root, leis-, which 
meant a track or furrow. Before it reached its present form, it went through 
many changes: laestan, leornian, lernen. At various times in the evolution of 
our language it might have been understood as following a track, continuing, 
coming to know, or perhaps even getting into a rut. From the same root came 


the verb last, to endure. 


cerned with how organisms come to behave in new 
y ways, and some ways may be more appro- 
priate as instances of learning than others. For example, two children each may 
have learned to answer multiplication questions. If one learned by studying the 
multiplication table and the other by mastering an electronic calculator, we would 
certainly agree that the children each learned something. We would also say that 
What each learned was different, and we might argue further that only the first 
child really learned to multiply. After all, that child could answer multiplication 
questions in a variety of circumstances, whereas the other child could do so only 
with a calculator. Thus, we could say what each child learned by describing what 
the child now can do and the circumstances in which the child can do it. 

This does not define learning, however. The task of definition is complicated, 
but the word learning does not give us trouble in everyday conversation. We can 
usually say whether or not we have learned something, and We are usually able to 
agree on what counts as learning even if we cannot give a definition. We may there- 
fore study learning by identifying particular cases, by observing the conditions 
under which they occur, and by examining how they can be combined. In our 
study of learning, we must remember to distinguish between two different 
Problems: first, what is the nature of the phenomena or events that we speak of as 
learning, and second, what is the best way of talking about them? 
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The study of learning is con 
Ways. But behavior can change in man 


THE LANGUAGE OF LEARNING 


First consider our language. We regard words like learning and knowledge as impor- 
tant and obvious. But often we fail to distinguish among the different functions of 
these words, and this can cause confusion. Sometimes we Speak of learning about 
something; at other times, we speak of learning how to do something. Someone 
who has learned how a bicycle works will not necessarily know how to ride a 
bicycle; inversely, someone who has learned to ride a bicycle may be unable to say 
what keeps a bicycle up, or even how the gears or the brakes of a bicycle work. 
We must decide whether we will treat these two kinds of learning together or 
Separately. Philosophers are concerned with this kind of distinction when they 
debate the difference between “knowing how” and “knowing that” (e.g., Ryle, 
1949). 

If learning could be defined in a sentence Or two, we would have no problem. 
We would define the word and then discuss the conditions under which learning 
takes place, the kinds of things that are learned, the limitations of learning, and so 
On. But learning has come to mean different things at different times and to 
different people, and it is not so easily defined. In fact, we might regard parts of 
this book as explorations of appropriate definitions for learning. What should we 
count as learning, and what should we exclude? 
Consider some examples. A pigeon discovers food in its travels, and returns 
to that place on later occasions When it is hungry. A child becomes able to read a 
story or to spell simple words. A dog is taught to sit or lie down on command. A 
patient who once had a bad experience in a dentist’s office consistently feels afraid 
in the waiting room. A Young cat, after its early hunting expeditions, comes to 
avoid skunks and Porcupines. A student reads a chapter in a mathematics textbook, 
and finds a way to solve a problem that once was baffling. A voter reads about the 
candidates for office in a newspaper, and votes on the basis of that reading several 
days later. 

What do these examples have in common? Whether they involve dogs and rats 
Or children and adults, each is an example of a change in what an organism does in 
particular circumstances. Learning, in fact, is often defined as a relatively 
permanent change in behavior resulting from experience (e.g., cf. Kimble, 1961, pp. 
1-13; K. Smith, 1969). But what do we mean by “relatively permanent,” by 
“behavior,” by “experience”? Staring at an eclipse of the sun is undoubtedly an 

s and it will certainly alter one’s future behavior if it permanently 


an instance of learning, we 


» however, whether there is any profit 
learns a route to food toge 


matics problem. 


ngle-celled Slipper-shaped micro-organism, 


anisms can learn is one of the lesser but 
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persistent controversies of the psychology of learning. In one experiment, a food- 
coated needle was repeatedly dipped into the center of a water dish that contained 
paramecia. After each feeding, the water was stirred, but nevertheless the paramecia 
began to congregate, after several feedings, in the region where food was presented. 
It was therefore argued that the paramecia had learned the location of food. The 
argument ended, however, when it was found that the stirrings did not adequately 
disperse the physical residues of past feedings. The congregation of the paramecia 
depended on these residues rather than on past opportunities to eat (Jensen, 1957). 
The experiment, therefore, had not demonstrated learning. 

In another experiment, paramecia were allowed to swim into a lighted area 
that had also been made hot. After this pairing of light with heat, the paramecia 
avoided other lighted areas even though those areas had not been heated. It was 
argued that, through the pairing, avoidance of heat had been conditioned to the 
light. But this argument, too, abated, when it was later discovered that heated 
Paramecia avoid light whether or not the light and the heat have been paired with 
each other (Best, 1954). The paramecia avoided the light because that is what 
heated paramecia do, and not because they had learned how light was related to 
heat in these experiments. This experiment, too, had failed to demonstrate learning. 
Thus, the answer to the question, whether paramecia learn, has changed from time 
to time. | 

But what kind of a question is this? Certainly the behavior ofa paramecium 
can change. Yet the same change in behavior might or might not be considered an 
instance of learning, depending on the circumstances that led up to it. The question 
is in part about paramecia, but it is also about the conditions under which we say 
that pr sess on se tga Ifa paramecium is exposed for a while toa 
temperature somewhat higher than the one at which it had been living, it may 
subsequently survive still higher temperatures that would otherwise have killed it 
(Beale, 1953). Might we not say that the paramecium had leamed to tolerate higher 
temperatures? And what could we discover about the way in which this tempera- 
ture adaptation works that would convince us to speak differently about it? There 
is no simple answer to these questions, but they at least illustrate that we must 
Worry not only about the behavior that we study but also about the way in which 


We speak of it. 


At this point, some might argue that these paramecium examples are irrele- 


Vant in any case. The paramecium is a single-celled organism, and therefore it can 
have no nervous system. It might be suggested that our Problems of definition 
would be resolved if we added that the change in behavior had to come about 
through some kind of change in the brain. But we do not look at an organism’s 
brain to decide whether it has learned something. We all have learned to say when 
We or others have learned something, but few if any of us have ever seen a brain 
doing anything. Even if we could watch a brain doing something, how would we 
know that what it was doing was learning? This is not to say that leaning, Has.no, 
Physiological basis; it would be interesting to know what physiological changes 
accompany learning. Yet if we cannot even say what learning is, we might find it 
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Properties of learning. 


The assertion that learning must take Place in the brain Carries with it many 
assumptions. If we study what an athlete learns, for example, shall we assume 


Physiological basis. 


The problems so far have only Partly been about the facts of learning; they 
have also been about our language. Languages are changeable, and their vocabularies 


not care whether the child came to be that Way because coo 
Or because anything else that the child did was punished. 


sive, trustworthy or unreliable. Describing people with words like artistic, athletic, 
Social, intellectual or musical specifies their preferred activities. This kind of 
vocabulary, however, is not suitable for discussing how Particular interests or traits 


It is interesting that human muscle cells, like human nerve cells, neither multiply nor 
regenerate in maturity; with exercise, individual muscle cells ETrOw in volume (Edgerton, 1973). 
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mple, there is an important difference between 
lying and telling the truth. But the habitual liar may lie for some of the same 
reasons that an honest person tells the truth. If circumstances are such that one 
child keeps out of trouble only by telling lies and another only by telling the truth, 
it would not surprise us if the first child grew up to be less truthful than the second. 
Yet the behavior of each child is shaped by its consequences; each child behaves so 
as to keep out of trouble. This shaping of behavior is our interest, but our everyday 
vocabulary does not equip us well for discussing it. 

Similar problems exist in fields other than psychology. When physicists look 
at events in the world, they do not find the everyday vocabulary adequate. They 
coin new terms or take over existing ones. The latter course can create difficulties. 
Words like work, force, and energy, for example, mean different things to physicists 
in their technical talk than they do to most other people in casual conversation. 
Fortunately for physicists, many of the phenomena that they study are far enough 
removed from our daily experience that we do not confuse their technical language 


with our ordinary discourse. 
This is not the case in psyc 


developed in an individual. For exal 


hology. We are all inescapably concerned with 


behavior. We speak of how people grow and change, we speculate about the reasons 
People have for doing things, and we ourselves learn new facts and acquire new 
Skills. If we wish new ways of talking about these events, we must take care that the 
new language does not get confused with the old one. We all have spent most of our 
lives talking in particular ways about what we do, and these familiar ways of talking 
may interfere with any new ways of talking that we may try to establish. Some 
parts of this volume will be concerned with establishing a behavioral language, and 
it will be important to note that the language will not be simply a paraphrase of 
everyday usage; it will instead require some new ways of dealing with familiar 


Phenomena or events. 


THE WORLD AND THE LABORATORY 


made even more difficult because we live in a 
influence what we do and how we change do not 


Occur in isolation. Thus, to understand any given situation, we must analyze it; we 
must strip away the unessential details to get at the basic processes that may be 


common to different circumstances. To analyze something is simply to break it 
For this reason, we turn to the laboratory. We 


down i i nent parts. 

begin ih Le Cnty BF লা simpler than ourselves, in simple environments. 

We must face the objection, of course, that a laboratory learning experiment 

is artificial and therefore inappropriate for establishing generalizations about 
t start with simplicity so that we can 


learn; i laboratory. But we mus 
arming outside of the ihe EO OEE 


develo iques and vocabularies with which 
he olen laboratory environment enables us to look at one thing at a 


time. We can arrange circumstances 50 that we know what goes into the 
experimental situation; if we are careful, we can exclude the distractions that might 
Otherwise obscure the processes that we wish to study. The simplicity of our 


The problems of language are ! 
complex world. The events that i 
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laboratory environment may also help us to see important aspects of learning, and 
therefore to develop a vocabulary appropriate to our Subject matter. We have to be 
able to identify events before we can study their properties. 

These advantages of the laboratory are illustrated in the following example 
about a pigeon that earns its food by pecking (cf. Kelleher and Gollub, 1962). The 
account is of interest because the outcome is not at all intuitively obvious. In itself, 
it is not a learning experiment, but it grew out of experiments on how organisms 
learn to order their behavior in time, and its methodologies have proved useful in 
other learning studies. Imagine a pigeon in an experimental chamber. The pigeon 
has been trained to earn food by pecking a key, a translucent disk recessed in the 
chamber wall (in recording pecks, this key operates much like a telegraph key). 
The key can be lit from behind to present visual stimuli to the pigeon. Currently, 
it is lit white. Our equipment is set so that every two-hundredth peck gives the 
Pigeon access to food in a hopper below the key. We reached this point gradually, 
first by letting every peck produce food, and then only every fifth peck, and then 
only every tenth, and so on, until finally the pigeon was pecking 200 times for 
each food delivery. 

In this situation, with no food but what is produced by pecking, the pigeon 
easily maintains an adequate daily ration. In fact, the pigeon eats its daily diet in as 
little as 4 or 5 minutes. If each food delivery lasts only 5 seconds (including the 
time to get from key to food hopper), the pigeon may be finished eating for the 
day after only 60 food deliveries. Even SO, this pigeon must peck at least twelve 
thousand times daily to earn this much food. 

Technically, the requirement of 200 pecks per food delivery is called a fixed- 
ratio schedule of reinforcement: fixed because the required number is constant 
from one food delivery to the next, and ratio because the schedule specifies the 
ratio of pecks to food deliveries (in this case, 200 to 1). The pecks occur because 
they produce food; we therefore say they are reinforced by food. But we do not 
have to reinforce every peck to keep the pigeon pecking, and the schedule simply 
specifies how we choose the particular peck that will produce food. Reinforcement 
schedules will be considered in detail in chapter 8, however, so let us return to the 
pigeon. 

Food has just been presented, the hopper has moved out of the pigeon’s 
reach, and the pigeon is standing before the white-lit key. Time passes, perhaps only 
15 or 20 seconds or perhaps several minutes. Eventually the pigeon approaches the 
key and begins to peck. Within a few pecks, the Pigeon is pecking as rapidly as 5 to 
10 pecks per second, and is not likely to pause at all as it completes its 200 pecks. 
The last of these pecks operates the food hopper. Given its rapid pecking, the 
Pigeon may peck 2 or 3 times more before it stops and moves down to the hopper 
to eat. When the food is withdrawn, time again passes before the pigeon returns to 
the key and begins its next 200 pecks. This is typical fixed 
Pause after reinforcement, followed b 
with the next reinforcement. 


“ratio performance: a 
Y rapid and uninterrupted pecking that ends 


The point is that this is easy to get the pigeon to do. Our Pigeon will earn 
enough food daily to keep itself alive and healthy for an indefinite time (its life 
expectancy is at least 15 years). If we kept it in its chamber 24 hours a day, we 
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would not have to intervene except to provide water, clean the chamber and keep 
the food hopper filled. 

But pigeons cannot count very well, and we might wonder whether the 
Pigeon is at a disadvantage each time it goes through its 200 pecks. Suppose we 
decide to help it to estimate how many pecks are left in the sequence, by present- 
ing different lights on the key depending on the number of pecks that have 
Occurred. Let us arrange that, after each food delivery, the key is blue until the 
Pigeon has pecked 50 times. The fiftieth peck turns the key green. After 50 more 
Pecks, the key turns yellow. Then, after 50 more, the key turns red. After 50 more 
pecks with the key now red, the 200 pecks are completed and the last peck 
Operates the food hopper. The pigeon eats, and the key is once again blue. The 
number of pecks required for food remains the same; only the lights on the key 
have changed. (The pigeon, by the way, can easily distinguish among these colors.) 

With no distinctive stimuli, the pigeon worked well enough, earning an 
adequate daily ration. Have we helped the pigeon by providing the colors to tell it 
Where it is in the sequence? The surprising effect of adding these stimuli is that the 
Pigeon begins to slow down. The pauses after each food delivery, when the key is 
blue, become longer. When the pigeon finally begins to respond, the pecks, which 
Used to occur in rapid succession, now occur sporadically. When the key turns 
green, after the fiftieth peck, the pigeon may even pause before starting the next 
S0-peck sequence. When the key had been always white, the Pigeon did not pause 
after SO pecks; the 200-peck sequence usually occurred without interruption. In 
Breen, another 50 pecks, and the key turns yellow. This time the pigeon is less 
likely to pause at all: 50 more pecks turn the key red, and the pigeon quickly com- 
Pletes the last 50 pecks, food is delivered and the pigeon eats. But then the key is 
blue again, and another long pause begins. 

The added colors did not help. The pigeon takes much longer to earn each 
food delivery than it had when the lamp remained white. In fact, even though this 
Pigeon had maintained an adequate diet at 200 pecks Per food delivery when the 
key was always white, we would now be wise to watch it carefully to be sure that 
its daily food intake does not decrease substantially. And this results not from 
Any change in the pecks required for food, but simply from a change in stimuli. 
When we added the colors, we broke down the 200 pecks into 4 distinct units of 
S50 pecks each. We call these chained fixed-ratio schedules: the separate stimuli 
COrrespond to the links of the chain. But the chain does not hold the 200-peck 
Sequence together more cohesively; instead, it breaks it up. When the key was 
always white, pecking at the start of the sequence was not so very different from 
Pecking near the end, when a peck finally produced food. Once the stimuli were 
added, however, pecking in the presence of the early stimuli became less like 
Pecking later. In blue, for example, pecks never produced food; at best they turned 
the key green, but pecks never produced food during this color either. With pecking 
teduced early in the chain, the time to complete each 200 pecks increased. In 
Chaineq schedules, a stimulus supports less responding the further it is from the end 
Of the sequence. Even severe deprivation of food is not sufficient to counteract this 


effect. 
There are other ways to alter the pigeon’s performance. For example, increas- 
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ing the required pecks, to 300 or even 500 per food delivery, also slows the pigeon 
down, creating long pauses after reinforcement and frequent interruptions of peck- 
ing; this effect is called ratio strain. We could now go on to examine the phenome- 
non in more detail, separating those properties of the situation that mattered from 
those that did not. For the Present purposes, however, the point is that the 
outcome is not obvious. It arises out of an experimental analysis of behavior and, 
because this effect occurs with other Organisms besides the pigeon, with different 
kinds of responses and different response requirements, and with different kinds 
and orderings of stimuli, it has substantial generality. Thus, we may wonder 
whether it is relevant to human behavior. 

The things people do, of course, depend on consequences more complex than 
food deliveries. Any account of human behavior must grapple with the fact that 
completing any sequential task, such as writing a paper, preparing a meal, or finish- 
ing a research project, is affected not only by its properties as a behavior sequence, 
but also by competing obligations and interests, delays between actions and 
outcomes, and any number of other factors. Nevertheless, so much of what we do 
involves sequential behavior that we must wonder whether we might sometimes 
Operate under the strain of too many links in our chains. The ability to formulate 
and achieve long-range Objectives is supposed to be a unique characteristic of the 
human species. But in the Pigeon’s behavior, adding even a Single link to a chain can 
have devastating effects. Perhaps we ought not assume that human behavior will 
inevitably be inmune to such effects. 

In any case, the example demonstrates that the laboratory may yield interest- 
ing phenomena even under relatively simple circumstances. The Pigeon’s chamber 
With its single key and its feeder may seem austere, and yet merely adding some 
colored lights produces effects with perhaps far-reaching implications. The 
controlled laboratory environment, therefore, reveals properties of behavior to us 


at the same time that it helps us to develop a language that is appropriate to this 
Subject matter. 


BEHAVIORAL AND COGNITIVE PSYCHOLOGIES 


» but it is behavior nonetheless. 


Ology, our terminology and ob- 
Servations and theories must ultimately be derived from behavior, from what 


On the other hand, there is more to an 
Two students may sit silently through a lect 
instructor that one is able to answer certain 


Organism than shows in its behavior. 
ure, and yet it may be clear to the 
questions and solve certain problems 
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while the other cannot. Although they might earlier have been distinguishable on 
the basis of past performances, the fact remains that they are not currently 
behaving differently. The difference is in what each potentially can do; we might 
say simply that one student knows more than the other. When we study this know- 


ledge, it is tempting to say that we study the student's mind. 
The debate between psychologists who call themselves behaviorists and 


Psychologists who call themselves mentalists 
One. To some extent, the debate has been about appropriate ways of talking of 
Psychological events. The behaviorist argues that, because behavior is all that is 
available to measure, the language of mental events misleads by calling attention 
away from the fundamental subject matter. The behaviorist is especially troubled 
When a mentalistic account is accepted as explanatory and therefore discourages 
further inquiry. For example, we may sometimes say that an idea, a feeling or a 
hunch led someone to do something. The behaviorist does not dispute the existence 
Of ideas, feelings and hunches, but rather criticizes their invocation as causes of 
behavior. It is too easy to be satisfied with an explanation in these terms; it is not 
enough to say that someone did something because of an idea, a feeling or a hunch. 
Ideas, feelings and hunches are about the world, and therefore must have their 
Origins in our experiences with the world. We must look further, to these past 
experiences or. in other words, to past behavior, to account for what we do. If we 
are SUCCES, We will at the same time have something to say about the origins 
Of our ideas, feelings and hunches. We will be able to say that these are names for 
Certain things that happen to us when we deal with events in the world. 

The cognitivist maintains that such a view 1s unnecessarily narrow. There 
must be processes that occur in our dealings with the world that are not observable 
in our behavior. When we try to recall a word that is ‘on the tip of our tongue” or 
try to solve a problem by “sleeping on it,” things are happening that do not show in 
Our behavior, and we may not even be able to report these things in our verbal 
behavior. If we can find out something about such processes, it cannot help but be 


relevan ing. 

es to ne Sociiean these two kinds of psychologists is simply 
a matter of language. The difficulties have persisted not because behaviorists and 
Cognitivists cannot understand each other, and not because there are psychological 
Problems that either can resolve that the other cannot, but rather because the two 
kinds of psychologists are interested in different types of TUEST ON: ie behaviorist 
tends to be interested in questions of function, and the cognitivist in questions of 
Structure (Catania, 19730). 

Suppose we are interested i 
May become concerned with wha 


or cognitivists has been a long-standing 


ing a child to read. On the one hand, we 
t we have to do to involve the child in reading. 
€ worry about what will keep the child alert, what ki make the child pay 
attention to the words presented, and what will help the child ether What the 
Various words are. Will we be more successful if we reward the child for being 
Correct or penalize the child for being wrong? When we arrange diffe vent 
Consequences for different answers the child might give, we are determining the 
Unctions of these various answers Of, more precisely, the functional relations 


n teach 
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between behavior and its consequences. Problems of function are concerned with 
how behavior and environmental events can act upon each other. 

On the other hand, no amount of concern with the effects of reward and 
punishment on the child’s mastery of reading will tell us the most efficient way to 
Present reading materials to the child. How is reading structured? What is the best 
way to order the materials? We might debate whether we should teach the child to 
read by starting with single letters, with syllables, or with whole words. When we 
arrange different orders of presentation of the materials to be learned, we are 
concerned with the effective structural relations within the subject matter. Are 
words unitary structures, or are they complex structures built up from simpler 
units, letters or syllables? Problems of structure are concerned with how behavior 
and the environment are organized. 

Both problems are important. Any attempt to affect how children learn to 
read will be deficient if it ignores either of them. But it is also important to distin- 
guish between them. There is little value in arguing over the priority of one or the 
other approach if both are indispensable to our understanding. Historically, 
difficulties arose because Psychologists interested in functional problems tended 
to speak a behavioral language, whereas those interested in Structural problems 
tended to speak a mental or cognitive language. But a behaviorist can study 
structural problems just as a mentalist can study functional ones. That these two 
different orientations, behaviorism and mentalism, tended to be correlated re- 
spectively with two different research emphases, functional and structural, was 
not inevitable. It is easy to see, however, how the correlation may have come about. 
If the experimental concern is functional, one studies the consequences of particu- 
lar relations between specified environmental events and specified actions; these 
relations are conveniently expressed in the behavioral language of stimuli and 
responses. If the experimental concern is Structural, one studies the properties of 
Particular capacities or abilities; these properties are conveniently expressed in the 
Cognitive language of knowledge and mind .2 

We need not be side-tracked by this controversy in Psychology. We will con- 
sider both functional and structural problems in learning and will therefore examine 
both types of research. This text is written from an explicitly behavioral Orientation, 
but in dealing with problems of structure it will necessarily extend this orientation 
to cognitive concepts. Although these two Psychological orientations differ in their 
languages and in the research problems that they emphasize, they have much in 
common: the reliance on experimental method; the anchoring of concepts to 


ons of an organ might determine 
en structure and function did not 
e ctions, such as that between mechanism and vitalism 
(Catania, 1978). 
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experimental manipulations and observations; and the assumption that our subject 
matter, however complex, is orderly and not capricious. 


IMPLICATIONS OF THE ANALYSIS OF BEHAVIOR 


To assert that behavior is orderly is simply to say that we believe behavior can be 
understood. If human actions were capricious, we could not expect to discover any 
Beneral principles by which to interpret these actions. Yet the assumption that our 
actions are caused or determined or predictable is sometimes regarded as incom- 
patible with any dignified view of human nature. We value freedom, but freedom 
seems inconsistent with a scientific view of behavior. How can this difficulty be 
resolved? 


We say that we are free when we are able to make choices. We choose to work 


Or to play, to spend time with one companion or another, to accept things as they 
are or try to change them. How are any of these choices made? If the choices are 
important, they are not left to chance; we say that we weigh the alternatives. In 
each instance, the potential consequences of engaging in one alternative or the 
Other influence what we choose. How can potential consequences enter into our 
decision, however, except by way of our earlier experience with other choices? It 
is in this sense that we may speak of our choices as determined. Would We wish to 
be free if by that we meant that our choices would be unaffected by their potential 
Consequences, or that we were indifferent to the possible outcomes? 

This point is sometimes misunderstood because we are often more concerned 
With another aspect of the concept of freedom. We are not so much worried about 
Whether our choices are determined as We dare about “how our choices are 
determined. We say we are less free when our choices are limited, and we are then 
Concerned with freedom and coercion. To the student, for SHED, there is an 
important difference between enrolling in a course because of one’s interest in the 
Subject matter and enrolling in a course because itis required; the choice to enroll 
May be as predictable or determined in one case as in the other, but we speak of 


the latter choi fi 
choice as forced, not free. et) $ Kt 
Once freedom has been defined in terms of the availability of choices, it even 


becomes possible to examine it experimentally. If an orn 188310. 10 De free 
Only when alternatives are available, it can further be said to value freedom only if 
it Prefers the availability of alternatives to their unavailability. Voss an Homzie 
(1970; see also Catania, 1975), for example, studied food-deprived rats’ preferences 
for two paths leading to food. One path consisted of a single predetermined route; 
the other allowed the rat to choose between two subpaths. Fourteen of fifteen 
rats in this experiment preferred the path that allowed a CS sf subpaths. 

hether this preference had phylogenetic origins or was somehow acquired during 
each rat’s lifetime, it can at least be concluded that the concept of freedom has a 
Place in the anialysis of behavior. And if a preference for freedom tus out to be 
Part of the phylogenetic endowment even of the rat, then corresponding human 
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preferences take on a special significance, because they need no longer be 
attributed merely to the practices of particular human cultures. 

We may prefer free choices, but we must also recognize that our choices are 
restricted by the nature of the physical world, by our own predispositions and 
capacities, and by the deliberate and the accidental actions of others. In our 
society, persuasion, legal sanction, threats, enticements, force, all are techniques 
that may be used to limit our choices. The techniques may be obvious or subtle, 
direct or devious, but they all enter into the potential consequences of our various 
alternatives and thereby influence our decisions. The issue becomes one not of 
whether our choices are determined, but rather of how our choices are 
manipulated, and by whom. One contribution of psychology may be simply to 
demonstrate the various ways in which our behavior can be influenced. If the 
determinants of human behavior can be identified, then these determinants must 
not be ignored; we cannot eliminate them simply by choosing not to study them. 
If our understanding of these determinants leads to the refinement of techniques 
to control human behavior, then this understanding must not be discouraged; we 
can best defend against the misuse of such techniques if we understand them. 

That human behavior is determined or at least predictable is not, of course, 
an empirical fact that can be confirmed or disproved. It is an assumption with 
which a scientific account often proceeds. This does not imply that we expect to 
be able to interpret any instance of behavior. There are limits to what we can know. 
It is tempting to ask a psychologist to explain why someone behaved in a particular 
Way, what led up to a particular incident, or how someone came to have particular 
interests, fears, or attachments. But the psychologist often has so little information 
available that only a plausible interpretation can be provided. This situation is not 
different except in degree from that in other sciences. Just as the principles of 
aerodynamics are not invalidated if we are unable to account for every twist and 
turn in the particular path of a particular falling leaf, the principles of behavior are 
not invalidated if we are unable to account for every detail of a particular 
organism’s performance on a particular occasion. We must not expect too much. 


OVERVIEW 


We turn now to the science of behavior. If the study of learning is about how 
behavior can be modified, we must first consider what behavior is, how it can be 
investigated, and what vocabulary might best describe it. The next chapter 
considers some basic properties of behavior, some procedures that have been used 
in the study of learning, and some problems created by our ways of talking about 
behavior. Later chapters examine specific procedures in more detail. They begin 
with the simplest procedure, presenting a stimulus, and then move through the 
arrangement of consequences and the nature of behavioral units to more complex 
procedures, including those involving the signalling effects of stimuli. The principles 
developed are then applied to an analysis of language, which provides a transition 
to a treatment of human verbal learning and memory. The concluding topics briefly 
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examine behavioral processes called cognitive, including problem solving and 
thinking. We will find throughout that the most profitable course is one that stays 
close to the data; we will worry less about psychological theory than about the 
Proper description of psychological findings. For example, it typically will be more 
useful to describe what an organism has learned or has remembered than to attempt 
to explain its learning or its memory. In our study of learning, it is important to 
recognize what remains out of our reach. 


Behovior 


TWO 


The word behavior, like habit, inhibit and ability, is related to the Latin 
habere, to hold, possess or have. The prefix be- became attached in such 
words as the Old and Middle English behabban and behaven, which meant 
to hold or bear oneself in some way. These words were closer to the sense of 
comportment or demeanor than to the more contemporary sense of activity, 
just as habit once referred more commonly to what was worn then to what 
was habitually done. A further aspect of this etymology or word history is 
that the Latin habere and the Latin capere, to take or seize, can be traced 
to closely related Indo-European roots. Capere leads to such psychologically 
Significant words as concept and perception; thus, these words and behavior 


are distant relatives. 


Behavior is no easier to define than learning. We may say glibly that behavior is 
Anything an organism does. We soon wonder, however, whether this definition is 


too global. Should we count respiration, the beating of the heart or metabolism 
ular secretions? We describe behavior 


along with muscular movements and gland 
But although we may say that 


With verbs: people walk, talk, think, do things. 
Someone breathes, we are not likely to say that someone heart-beats. We also 


like to distinguish between active and passive actions. People bleed when cut and 
fall when they lack support, but we may not wish to speak of their bleeding or 


their falling as behavior. 
Let us not try to resolve this pro j 
af behavior, and to consider how these properties can be described. The phenomena 


Of behavior are varied even though they sometimes share common names, so we 
Shall probably do better by considering examples than by attempting definitions. 
e can deal with specific examples without much risk of misunderstanding. 

When we observe an organism, all we see are properties of the organism’s 
environment and properties of its behavior. We call these properties stimuli and 
responses, but neither a stimulus nor a response is of interest by itself. We study a 
Stimulus to find out how it may affect responses, and we study a response to find 
Out how it may be affected by stimuli. An experimental analysis determines what 


blem. Our aim is to examine some properties 
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kinds of relations exist between stimuli and responses, and how these relations can 
come about. 


BEHAVIORAL HIERARCHIES 


Observing the organism is the simplest thing we can do. For example, if we were 
interested in the behavior of a rat, we might just watch it (cf. Bindra, 1961). At 
first the rat might remain motionless. Eventually it might begin to move about. We 
might notice it sniffing, or licking and grooming itself, or urinating and defecating. 
Some changes in its behavior might ordinarily be inaccessible to us; only by setting 
up the necessary recording equipment could we note changes in its respiration, 
heart rate or blood pressure. 

These and other responses could easily be inventoried. We could rank the 
various responses by how often they occurred, list the sequences in which they 
were ordered, and determine which could occur simultaneously and which only in 
Succession. But to find out more, we would have to provide an environment in 
which the rat could engage in other different kinds of behavior. 

Let us hypothetically construct such an environment by providing entrances 
into various compartments that give the rat access to different stimuli. Figure 2-1 
shows some possibilities: compartments containing a full food hopper, a drinking 
tube filled with water, an activity wheel, or nesting material; a small dark compart- 
ment and a large bright compartment; compartments in which the rat’s entry turns 
On a display of dim lights on the wall, quiet music, loud noises, a cold shower, or a 
blast of hot air; and, finally, a compartment with an electrified grid floor. 

Here again we might observe locomotion, licking or grooming, urination or 
defecation. But we would also begin to learn about the rat’s behavior with respect 
to these various stimuli. Presumably if the rat lived in this area for some time, it 
would begin to enter the food compartment occasionally and eat from the food 
hopper, or enter the water compartment and drink from the drinking tube, or enter 
the activity compartment and run in the activity wheel. After a few days, it might 
always sleep in the compartment with nesting material, or it might prefer the small 
dark compartment and perhaps would even carry some nesting material there. It 
probably would seldom stay for long in the large bright compartment. Eventually, 
it might spend brief periods looking or listening in the compartments with lights or 
music. But after only a few entries, it would probably rarely return to the compart- 
ments with loud noise, the cold shower, hot air, or the electrified grid. 

We must assume that the rat’s performance involves learning; the rat must 
learn where each compartment entrance leads. But our present concern is how the 
rat chooses to distribute its time among the compartments and, therefore, its 
tendency to engage in the various kinds of behavior that each compartment makes 
possible. Over any period of time, we could describe the rat’s behavior in terms of a 
hierarchy of responses, an ordering based on the relative frequencies with which the 


en responses occur. The organization of behavior in terms of the relative 
ikelihoods of different responses has been referred to as a habit-fami i h; 
(Ful, 1943) abit-family hierarchy 
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d relative frequency of visits to the various 
basis of such a record, the rat’s behavior can 


Figure 2.1. A hypothetical b 
Open area in the center, the order an 
Compartments can be recorded. On the 
be described in terms of a response hierarchy. 


ost active, the rat might go to the food 
then move on to the compartment with 
to the activity-wheel compartment and 


During the night, when rats are m 
Compartment and eat for a few minutes, 


the water and briefly drink, and finally go b ঠি 
run for a while. At ) we might say that eating is highest in the hierarchy, with 


drinking next and then running. Once some eating Hasraccurred; Crinking brie i 
Place at the top of the hierarchy. After some water has been consumed, drinking is 
displaced by running. In other words, the order of responses in the hierarchy 
Changes from time to time. In the daytime, the responses of eating and drinking 
and running all might be relatively low in the hierarchy; the rat might spend most 


Of its time sleeping in the small dark compartment. Y | ‘ 
At this BEDI one might argue that instead of the hierarchical ordering of the 


i 2s behavior in terms of its relati 
rat’s responses, we should discuss the rat’s b elative 
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preferences for different stimuli. We might then describe the rat’s movement from 
eating to drinking to running simply by noting that food was at first the most 
preferred stimulus and that later this preference shifted to water and to the running 
wheel. In this language of preference, however, we would still be dealing with 
exactly the same events. Thus, this example only illustrates how closely connected 
are our concepts of behavior and environment: we define environmental events in 
terms of their effects on behavior, and we define behavior in terms of its interaction 
with the environment. 

At any given time, of course, the rat can only be in one compartment. While 
the rat eats, for example, we can say little about the relative positions of listening 
to music or watching lights in this rat’s response hierarchy. But we could find out 
about them simply by closing all the compartment entrances except those two and 
observing how the rat distributes its time between music and lights. What we learn 
about the rat still depends on what we choose to measure. Describing the rat’s 
behavior in terms of response hierarchies will necessarily depend on what we make 
available to the rat in the compartments; some aspects of the rat’s behavior will 
inevitably be omitted. For example, the compartments in our hypothetical 
apparatus will tell us nothing about the rat’s social or sexual behavior. If we wished 
to study how this rat interacts with other rats, we would want to add more 
compartments, some containing a male rat, others containing a female rat, and 
still others containing different-sized groups of rats of one or both sexes. 

Nevertheless, we could begin to classify the rat’s responses according to the 
different probabilities with which the rat engages in them. The rat often enters the 
compartment where food sets the occasion for eating, whereas it rarely if ever 
enters the compartment where the electrified grid sets the occasion for jumping 
and squealing. The stimuli that generate these different responses have been called 
by different names; the former are sometimes referred to as appetitive or reinforc- 
ing stimuli, and the latter as aversive or punishing stimuli. The rat will produce the 
former by entering the appropriate compartment, but not the latter. Yet this 
classification is incomplete; the rat’s failure to enter certain compartments is not 
evidence enough that the stimuli in those compartments are aversive. Some stimuli 
may be effectively neutral. For example, the rat may infrequently enter the 
compartments with music or with dim lights simply because the rat is usually 
indifferent to such stimuli; listening to or looking at such stimuli may be relatively 
low in the response hierarchy. Thus, we must be concerned not only with the rat’s 
entry into different compartments, but also with its departures. 

At this point, simply watching the rat is not enough; we must intervene. To 
assess the effects of the stimuli in those compartments that the rat rarely enters, 
Wwe must place the rat in those compartments and measure how rapidly it leaves. 
We might find that the rat exits from the compartment with the electrified grid 
more rapidly than from those with cold showers Or hot air, that it exits from these 
more rapidly than from the large bright compartment or the one with loud noise, 
and that it exits from these in turn more rapidly than from the compartments with 
music Or dim lights. On the basis of these exits, we could order the stimuli of the 
Various compartments. The ordering would be relative, however, because it is not 
Obvious which compartment we should take as a baseline, or point of reference. 
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Even the open cet iGfirty RRs thié'rat may choose to spend some time. 
We must recognize that the rapidity of exit will depend not only on the stimuli in 
a4 compartment, but also on the availability of other compartments. For example, 
if all the other compartment entrances were closed off and the rat learned that it 
had to remain in the open central area, we might not expect it to leave the large 
bright compartment as quickly as it would if it learned that entry to all the other 
compartments were still available. 

From such observations, we must conclude that stimuli do not fall neatly into 
4 discrete classification. We cannot divide the environment into three simple classes 
Of events, such as appetitive, neutral, and aversive. Instead, each stimulus, and the 
responses it occasions, must be evaluated relative to the other available stimuli. 
At any given time, the environment provides a continuum or range of possibilities: 
the organism may be highly likely to expose itself to some stimuli, may be 
relatively indifferent to others, and may not only fail to expose itself to but may 
also quickly remove itself from still others. The ordering of events along this 
Continuum can change from one moment to another. Changes can result from 
earlier responding, as when eating varies as a function of the time since the last 
meal, or from changes in stimulus properties. If we are interested in the effects of 
music and vary its loudness, for example, we might find that the rat enters that 
Compartment at quiet levels, is indifferent at moderate levels, and avoids the com- 
partment at more intense levels. Thus, we would have to conclude that music is 
teinforcing, neutral or aversive, depending on its level. 

We cannot say what the effects of a given stimulus will be except by observ- 
ing behavior. We can call food appetitive or reinforcing only on the basis of 
behavior with respect to food; we can call dim lights neutral only on the basis of 
behavior with respect to dim lights; we can call electric shock aversive or punishing 
only on the basis of behavior with respect to electric shock. These examples all 
demonstrate that the characteristics of different stimuli cannot be specified 
independently of an organism’s behavior; but neither can the characteristics of 


Tesponses be specified independently of an organism’s environment. 


STIMULI AND RESPONSES 


muli and the rat’s responses in each 


This h i ith both the sti 
0 dealt with bo 
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Compartment. The close relation between the J j 
events was implicit. Let us now be more explicit about such relations, by turning 
to a different and more interesting organism, the human infant. | 
Again we begin simply by observing. We watch for a time, and we notice 
Movements of the hands or arms or legs. Perhaps at SDme. point the infant begins 
to Cry. If the crying stops without our intervention, the infant might sleep or might 
ie quietly with its eyes open. If we look closely, we might see the eyes moving, 
although it would be difficult to judge just what the infant was looking at, or 
Whether it was looking at all. We could begin to catalogue the various things that 
the infant did, and we might discover that particular movements usually Occur in 
Particular sequences. Just as with the rat, we would not learn very much if we only 
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watched; we could not say much more than that particular movements occurred 
more or less often and more or less in particular orders. j 

Again, we need not be restricted simply to watching. We can do things to the 
infant. We might touch it, rock it, move Objects before its eyes, make sounds, or 
Place a nipple in its mouth. We should not be Surprised if the infant responds to 
each of these events in a characteristic way. If the touch were to the palm of the 
hand, for example, the infant would probably clench its fist, grasping the object 
that touched it. The vocabulary for these events is already familiar: we call the 
touch to the palm a stimulus, and the grasping a response. A particular environ- 
mental event is followed by a particular instance of behavior. In this Case, we are 
not interested in the stimulus alone or in the response alone; we are interested in 
their relation to each other. We call this relation, the reliable production of a 
particular response by a particular stimulus, a reflex. 

We will consider the reflex in chapter 3. The important point here is that the 
term reflex is simply a name for a behavioral relation: an observed correlation 
between a particular stimulus and a particular response (Skinner, 1931). The 
concept is neither a theory nor an explanation. We observe a particular relation 
between environment and Organism, and we name it. If our naming has been effec- 
tive, we will simply know whether or not, with respect to other relations we might 
Observe, the same name is appropriate. 

Let us return to the infant. Besides the production of grasping by a touch to 
the palm, we might wish to catalog other examples of reflexes: the production of 
crying by a loud noise, or of sucking by a nipple in the mouth, or of blinking or 
pupillary contraction by a flash of light. These are not the only kinds of relations 
possible, however. When stimuli produce responses, the environment is acting on 
the infant; but the infant can also act on the environment. Its crying, for example, 
often brings its mother’s attention. Crying, then, is a response that can produce an 
environmental consequence: the presence of the mother. This behavioral relation 
involves stimuli and responses, but we cannot call it reflex because the stimuli 
and the responses do not occur in the appropriate order. 

The relations can become more complicated. If the infant moves its eyes 
While the lights are On, the eye movement may change what the infant sees. Eye 
movements cannot have this effect when the lights are off. Thus, the infant may 
come to look about in the light but not in the dark. In the presence of one 
stimulus, the light, moving the eyes may have consequences; it may produce 
another stimulus, some new thing seen. These Consequences cannot occur in the 
dark. Thus, the relation involves three terms: a prior stimulus, the light; a response, 
eye movement, in the presence of this stimulus 


newly seen as a Consequence of this response. 
prior-stimulus—response 


behavior depends both 


responses, however 
are these? 


A stimulus is an environmental event, but Such events have varying degrees of 
complexity. In our example, we regarded the infant’s mother as a stimulus. The 
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infant’s environment is certainly different when its mother is present than when 
she is absent. Yet what sort of stimulus is she? Early in its life, the infant cannot 
recognize her face or distinguish her from others. We do not know which aspects 
Of her looks, her voice or her touch are important to the infant. We might wonder 
Whether the infant would react to her any differently than usual if she approached 
Wearing a surgical mask. Nevertheless, despite our ignorance with respect to these 
questions, we do not doubt that the mother is an important feature of the infant’s 
environment. Lacking a more detailed account of the specific features of the 
mother to which the infant may respond, we may still find it useful to speak of the 
effects that she has as she comes and goes in the infant’s world. 

This example illustrates two different problems. If we try to analyze the 
Visual, auditory and tactile features of the mother that are important to the infant, 
We become concerned with the structure of this complex stimulus, the mother. We 
might ask how it is that the infant will be able to learn to respond to a particular 
individual as mother despite changes in her dress or hair style, her facial expression 
Or posture. But if we concentrate instead on how the mother interacts with the 
infant’s responses, we are concerned with the functional significance of the mother 
in the infant’s environment. For example, if an infant was crying, we would not 
have to know whether the infant recognized its mother by her face, her hair or her 
VOice to judge whether the mother’s presence made a difference in the crying; it 
Would be enough to note that when the mother went to the infant the crying 
stopped. Later, we will typically be interested in simpler stimuli: lights, sounds, 
food in the mouth. But even with simpler stimuli we will have to remember that 
Structural problems, in analyzing stimulus properties, are different from functional 
Problems, in analyzing the interaction between stimuli and SENOS iM 

And what about responses? How shall they be dealt with? In describing 


tesponses, we encounter at least two difficulties. The first is that behavior is not 
| the infant grasps an object on 
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Objects, without regard to the particular manner of performing these actions. 

Even in the absence of movement, we may sometimes conclude that behavior 
has occurred. For example, an infant will typically grasp an adult’s fingers 
sufficiently tightly that the infant can be lifted into the air. Once lifted, the infant 
may not move while holding on, and yet the very fact that the infant does not fall 
leads us to conclude that the grasping response continues. Similarly, if we see an 
adult standing still, our judgment that the adult is behaving stems partly from our 
knowing that the adult would fall down if unconscious or dead. It might be argued 
that the standing adult is in fact moving in small, unnoticed ways, but even if minor 
postural adjustments occur in standing still, we do not have to observe them to 
conclude that the adult is behaving. Thus, not all movements need to be instances 
of behavior, and not all instances of behavior need be movements. Whether 


behavior involves movement or not, it is important to examine its consequences. 


Thus, the critical feature of the infant’s grasping and the adult’s standing is simply 


that these responses have an effect; they determine whether or not the organism 
falls. 


OPERATIONS AND PROCESSES 


One more distinction remains before we consider some phenomena of learning: 
the distinction between operations and processes. We study the relation between 
events in the environment and the Organism’s responses by manipulating the 
environment and observing how this affects What the organism does. We operate 
on the organism’s environment Or, in other words, we perform experimental 
Operations. In the analysis of behavior, operations are what the experimenter does 
Or arranges, and processes are the changes in behavior that result.! Particular 
learning procedures can be described in terms of these operations, taken either 
alone or in some combination. 

The simplest Operation, of course, is merely to observe behavior. The 
behavior that we observe tells us what an Organism is capable of doing in such 
circumstances. But because we have no control over events when we only observe, 
we may be unable to draw conclusions about the causes of behavior. We therefore 
must intervene, and the simplest intervention is to present stimuli. When a stimulus 
Produces responses, we may speak of the Process of elicitation. A more complica- 
ted intervention is that of arranging the environment so that the organism’s 
behavior has consequences. Once responses have consequences, they may occur 
more or less often, and therefore consequential operations lead to the processes 
Sometimes called reinforcement and Dunishment. Finally, we can arrange that 
stimuli signal events, such as the presentation of other Stimuli or the availability of 
particular consequences. We then speak of stimulus-control operations; these 
Operations can only occur in combination with one of the simpler operations, 
presenting stimuli or arranging consequences. Behavior may then depend on 
whether the signalling stimulus is Present or absent. Thus, the basic operations that 
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Wwe will consider are: observing an organism's behavior, presenting stimuli, arrang- 
Ing consequences for responses, and signalling either stimuli or consequences. 


The observation of behavior 


What must we do to observe behavior? When we considered the rat, we 
Tecognized that its behavior would be interesting only if we gave it an interesting 
environment. Instead we might logically move in the other direction, avoiding any 
contamination of the rat’s behavior by environmental interventions. We could fit 
the rat with goggles to exclude visual stimuli and with ear plugs to exclude sounds 
and we could remove odors with a ventilating system. We might realize that the rat 
could still touch things, including parts of its own body, and so we might then 
Arrange a suit of hollow tubes that held the rat’s legs so that tactile contact was 
reduced for at least its paws. This might still not satisfy us, because the force of 
Bravity would produce pressures at those places where the suit met parts of the rat’s 
body and would allow spatial orientation. The next step would be to send the 
Suited rat up in a space shuttle, where gravity could. be eliminated. Yet even if we 
accomplished this much, what could we say about the rat’s behavior? What might 
We Observe the rat doing? 


The rat example is hypothetical, but experiments on sensory deprivation 


have placed humans in environments that approximate the conditions of minimal 


Stimulation suggested for the rat (e.8., P. Solomon, Kubzansky, Leiderman, Mendel- 
Son, Trumbull, and Wexler, 1961). The problem is that in environments like these, 
for human as well as for rat, there is not much to do; there is no place to go and no 
One to see or to speak to. Although humans in such environments report a range of 
Activities during their waking time, from thinking to hallucinating, it is not 
Surprising that they spend a fair proportion of time sleeping. 
yg So, we might conclude that we had been right in the first place. To Observe 
teresting behavior, we must place the organism in an interesting environment. 
§ an example, consider Kohler’s experiments on chimpanzee behavior (Kohler, 
1927), From 1913 to 1917, Kohler observed chimpanzees maintained at the 
nthropoid Station on Tenerife, an island northwest of Africa. In some of these 
eXperiments, bananas or oranges were placed in visible but inaccessible locations, but 
Materials within the chimpanzees’ chamber could be used as tools to gather in the 
uit. The following describes the behavior of the male chimpanzee, Sultan: 
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into the banana, only a few seconds had elapsed, a perfectly continuous 
action after the first hesitation (Kohler, 1927, pp. 39-40). 


In many instances, of course, chimpanzees made unsuccessful attempts to solve 
problems such as these; such attempts were aptly described as fruitless. 

Kohler was concerned with the ways in which chimpanzees could respond to 
the relations among objects and events in the environment, and he discussed these 
and related observations in terms of the chimpanzee’s intelligence and insight. 
His major contribution was in demonstrating what chimpanzees were capable of 
doing, and the readers of his time were impressed by his descriptions of chimpanzee 
performance. The problem was that it was not possible to say from observation 
alone where the behavior came from. Was Sultan able to solve a particular problem 
because of some inherited cognitive disposition, or because of features that the 
problem had in common with some situation that he had already encountered, or 
because of exposure to the behavior of other chimpanzees that he was able to 
imitate, or because of some combination of these and other factors? The term 
insight seemed appropriate because of the suddenness with which a solution to a 
problem often emerged, and problem solving that seemed insightful led to further 
questions: whether learning took place abruptly or gradually, and whether this 
type of problem solving could be explicitly taught. Debates about the instruct- 
ability of nonhuman primates continue to the present. But such issues cannot be 
resolved from observation alone. Observation by itself only rarely can indicate the 
Sources of behavior. 

Strictly, however, Kohler did more than simply observe behavior. He arranged 
environments within which to make his observations. Observation without interven- 
tion is difficult to achieve, because even bringing an organism into captivity is itself 
an intervention. Although he did not do so, Kohler instead could have searched out 
wild chimpanzees to observe in their natural habitats, or could have released some 
captive chimpanzees and attempted to track them in the wild. But the problem of 
keeping up with them as they moved on the ground or through the trees would 
have been compounded by the problem of also keeping hidden from them. To 
Observe successfully in the wild, one must know the possible effects of human sight 
and scent and sound on the organism under observation (e.g., van Lawick-Goodall, 
1968). To study these effects, one must present appropriate stimuli. Presenting 
stimuli, therefore, is a virtually inevitable component of any study of behavior. 


The presentation of stimuli 


We have seen that Kohler did in fact present stimuli, by arranging environ- 
ments for the chimpanzees he observed. Let us turn now to a different set of 


examples. The following, by the ethologist Tinbergen, describes the first feeding of 
the Herring Gull chick: 


Sometimes the parent stands up and looks down into the nest, and then we 
may see the first begging behavior of the young. They do not lose time in 
contemplating or studying the parent, whose head they see for the first time, 
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but begin to peck at its bill-tip right away, with repeated, quick, and 
relatively well-aimed darts of their tiny bills. They usually spread their wings 
and utter a faint squeaking sound. The old bird cannot resist this, and if only 
the chicks persist it will feed them. First the parent stretches its neck, and 
Soon a swelling appears at its base. It travels upward, causing the most 
appalling deformations and the most peculiar turnings and twistings of the 
neck. All at once the parent bends its head down and regurgitates an 
enormous lump of half-digested food. This is dropped, and a small piece is 
now picked up again and presented to the chicks. These redouble their 
efforts, and soon get hold of the food, whereupon the parent presents them 
with a new morsel. Now and then the chicks peck at the food on the ground, 
but more often they aim at the parent’s bill, and although this aiming is not 
always correct, it rarely takes them more than three or four attempts until 


they score a hit. (Tinbergen, 1960, p. 178) 


{ So far, we have here only some observations of chick behavior. But they 
involve the effects of stimuli, and therefore prompt some questions. What exactly 
are the critical features of these special stimuli presented by the parent gull? Are 
9 Are they the most effective ones possible? 
r these questions by preparing stimuli that 
He then compared the pecking generated 
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tip. A beak with a black or a blue or a white patch produced less pecking than one 
With a red patch, but a beak with a patch of any color produced more pecking 
than a beak without any patch at all. Compared to the red patch, the color of the 
beak and of the head were relatively unimportant in generating pecking. In fact, if 
the model did have a beak with a red patch on it, even the presence or absence of a 
ead made little di he 

পক TLL the shape of the beak, as illustrated in Figure 2-2. Next 

to each stimulus, pecking is shown as 2 percentage of the number of pecks 
Benerated by the normal beak shape, at the top (100%). The red patch and other 
Color differences were eliminated, so that changes in pecking could be attributed 
Solely to changes in shape. The interesting feature of varying shape is that, although 
most models produced less pecking than the model with the normal beak shape, 
€ model with an elongated beak (bottom) produced Et Sp pecking 
An any of the others, including the one with the normal beak shape. Because of 
S effectiveness relative to the normal shape, Tinbergen referred to this model as a 
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Figure 2-2. A series of models used in analyzing which properties of the parent 
gull’s beak produce begging pecks in the newly hatched Herring Gull chick. Pecks 
to each other model are expressed as a percentage of the reference level (100%) 


provided by pecks to the top model. (Adapted from Tinbergen and Perdeck, 1950, 
Figure 15.) 


Tinbergen was able to identify the critical structure of the stimuli that generated 
pecking in the newly hatched Herring Gull chick. 

Stimulus presentations are a common feature of research conducted by 
ethologists, whose concern is the evolution of species-specific behavior patterns and 
behavior in an organism’s natural habitats. One effect of stimulus presentations, as 
we have just seen, is the production of responses. This process, an outcome of 
presenting stimuli, is called elicitation; the stimulus is said to elicit a response. In 
the language of ethology, the critical stimuli or stimulus features are called releasers, 
and the behavior they produce is called a fixed action pattern. But variations in 
vocabulary should not obscure the simplicity of the basic operation of presenting 
stimuli. 

By presenting stimuli, we can determine what features are critical in generat- 
ing a particular instance of behavior, but we cannot necessarily deal with how this 
behavior changes over time. Data from the Laughing Gull chick provide an example. 
Feeding in the Laughing Gull chick differs in detail from that of the Herring Gull, 
but also includes a begging peck at the parent’s beak followed by the parent’s 
regurgitation of partly digested food that the chick then eats. The accuracy of the 
begging peck was tested by presenting beak models to chicks of various ages. Only 


about one-third of the pecks of newly hatched chicks struck the model, as opposed 
28 
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to more than three-quarters of the pecks of two-day-old chicks (Hailman, 1969). 
Was the improved accuracy a function of changes in coordination or in visual 
Xperience or in other factors? 

One possibility is that the consequences of accurately aimed pecks differ 
from those of poorly aimed pecks. In the Laughing Gull’s natural habitat, an 
accurately aimed peck is more likely than a poorly aimed peck to be followed by 
the parent’s regurgitation of food. Even without this outcome, an accurately aimed 
Peck will have a different consequence from a poorly aimed peck, because only the 
accurately aimed peck will contact the parent's beak. If contacting the parent’s 
beak and then feeding are important consequences for the Laughing Gull chick, we 
Might assume that responses producing these consequences will become more 
frequent than those that do not or, in other words, that accurately aimed pecks 
Will increase relative to poorly aimed pecks. Hailman’s observations in fact suggest 


that the chick’s behavior is determined by such consequences: 


to the target at first, its pecking thrust 
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To stuqy such effects, it is not enough simply to present stimuli. A more complex 

Operation must be arranged: Stimuli must be presented as consequences of the 
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Consequential operations 


oe ike? 
Again we move to a new set of examples, this time from Thorndike’s research 


On animal intelligence (Thorndike, 1898). The critical difference between Thorn- 
“Ike's research and Kohler’s was that Thorndike systematically observed changes 
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the time that he was in the box before performing the successful pull, Or 
clawing, or bite. ... If, on the other hand, after a certain time the animal did 
not succeed, he was taken out, but not fed. Thorndike, 1898, pp. 5-6) 


One of Thorndike’s puzzle-boxes is illustrated in Figure 2-3. In such 
apparatus, Thorndike studied cats, dogs, and chicks. The following description was 
offered as characteristic of the behavior of most cats: 


When put into the box the cat would show evident signs of discomfort and 
of an impulse to escape from confinement. It tries to squeeze through any 
Opening; it claws and bites at the bars or wire; it thrusts its paws out through 
any opening and claws at everything it reaches; it continues its efforts when it 
Strikes anything loose and Sshaky;it may claw at things within the box. . . .The 
cat that is clawing all over the box in her impulsive struggle will probably 
claw the string or loop or button so as to open the door. And gradually all the 
other non-successful impulses will be stamped out and the particular impulse 
leading to the successful act will be stamped in by the resulting pleasure, 
until, after many trials, the cat will, when put in the box, immediately claw 
the button or loop in a definite way. (Thorndike, 1898, p. 13) 
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consequence of what it did. The cat’s behavior changed as a result of this conse- 
quential operation, which made events depend on the cat’s behavior. Contemporary 
experiments, which often examine simpler responses in simpler situations, are 
Similarly concerned with the relation between responses and their consequences. 

We arrange consequences for responses by constructing environments in 
Which these responses can have specified effects. If we place food in the goalbox of 
a maze or a runway, for example, we create an environment in which a rat’s move- 
ment from the startbox to the goalbox will have food as a consequence. After the 
rat has reached the food on one occasion, we may examine how the food as a 
Consequence affects the rat’s behavior by observing the rat’s locomotion the next 
time we place it in the startbox. 

In consequential operations, varied consequences ! j 
stimuli of obvious biological significance such as food and water to relatively minor 
Changes in things seen or heard or touched. We need not even restrict ourselves to 
Consequences that involve the production of stimuli. Responses can remove stimuli, 
as when operating a switch turns off a light. Responses can prevent stimuli, as when 
unplugging a lamp before repairing it eliminates the possibility of a shock. 

sponses can even change the consequences of other responses, as when replacing 
2 burned-out light bulb makes the previously ineffective response of operating the 
light switch effective again. Each of these types of consequences can affect 
Subsequent behavior. Nevertheless, presenting stimuli and arranging consequences 
are not the only experimental operations in the analysis of behavior. 


can be arranged, from 


Stimulus-control operations 


The presentation of stimuli and the arrangement of consequences are seldom 
independent of other events in the organism's world. A flash of lightning typically 
Precedes a clap of thunder. A traffic light typically signals the possible 
Consequences of proceeding or stopping when one drives up to an intersection. 

hese two examples illustrate how the signalling or discriminative effects of stimuli 
may be combined with stimulus presentations or with consequential operations. 

he behavior of organisms does not occur indiscriminately; behavior that occurs in 
One cir ur in another. Er 

a enpalesst Seino and elicitation. The effects of ) that 
Signalled the presentation of other stimuli were the basis for Pavlov's studies of 
Conditional or conditioned reflexes (Pavlov, 1927). Pavlov S experiments were on 

e ways in which stimuli could acquire signalling properties. He examined responses 
to stimuli such as food, and then explored how these responses could be controlled 
Y other stimuli presented in some temporal relation to food. Pavlov spoke of the 
effects of food in a dog’s mouth in terms of the alimentary reflex, the components 
Of which included both the glandular response of salivating and motor responses 
Such as chewing and swallowing. He chose to concentrate on salivation in his studies 
cause the available technology made salivating PaSIE to measure than motor 
Tesponses: by surgery, the duct of one of the dog’s salivary glands was brought to 
the Outside of the dog’s cheek, where it was connected to a fluid system that 


Allowed drops of saliva produced by this gland to be counted. 
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For one dog, the presentation of food was consistently preceded by the 
sound of a metronome. Pavlov gave the following account of the conditions 
necessary to establish a stimulus as a signal: 


On several occasions this animal had been stimulated by the sound of the 
metronome and immediately presented with food—i.e., a stimulus which was 
neutral of itself had been Superimposed upon the action of the inborn 
alimentary reflex. We observed that, after several repetitions of the combined 
stimulation, the sounds from the metronome had acquired the property of 
stimulating salivary secretion and of evoking the motor reactions characteris- 
tic of the alimentary reflex... Hence a first and most essential requisite for 
the formation of a new conditioned reflex lies in a coincidence in time of the 
action of any previously neutral stimulus with some definite unconditioned 
stimulus. Further, it is not enough that there should be overlapping between 
the two stimuli; it is also and equally necessary that the conditioned stimulus 
should begin to operate before the unconditioned stimulus comes into action. 
If this order is reversed, the unconditioned stimulus being applied first and 


the neutral stimulus Second, the conditioned reflex cannot be established at 
all. (Pavlov, 1927, pp. 26-27) 


Pavlov’s conditioning experiments are effective demonstrations of how a stimulus- 
control or signalling operation can be superimposed on the simpler operation of 
stimulus presentation. 

Stimulus-control operations and consequences. Instead of signalling the 
presentation of stimuli, a stimulus may signal occasions on which responses will 
have particular Consequences: a stimulus-control operation can be superimposed on 
4 consequential operation. For example, an environment can be arranged in which a 
rat’s lever presses produce food when a light is on but not when the light is off (cf. 
Skinner, 1933, 19344). In this instance, the light signals the current consequence of 
pressing the lever, and we might expect the rat to come to press the lever more 
often when the light is on than When it is off; we may say that the lever press is 
reinforced in the Presence but not the absence of the light, and that the light is a 
discriminative stimulus. 

The relations between a discriminative stimulus and the consequences of 
responding are elaborated in the following passage by Skinner (the term operant 


refers to a class of responses having particular consequences, and the term reinforce- 
ment refers to these consequences): 


...the operant must operate upon nature to produce its reinforcement. Al 
though the response is free to come out in a very large number of stimulating 
producing a reinforcement only in a small 
part of them. The favorable situation is usually marked in some way, and the 
Organism. . .. comes to respond whenever a stimulus is present which has been 
present upon the occasion of a previous reinforce. 
otherwise. The prior stimulus does n 
occasion upon which the response will be reinforced 


therefore be Considered: a prior discriminative stimulus (SD), the response 
(RO), and the reinforcing stimulus (S! i 
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the stimulating environment. A certain movement of my arm (RO) is rein- 
forced by tactual stimulation from a pencil lying on my desk (SU). The move- 
ment is not always reinforced because the pencil is not always there. By 
virtue of the visual stimulation from the pencil (SD) I make the required 
movement only when it will be reinforced. The part played by the visual 
Stimulus is shown by considering the same case in a dark room. At one time 
I reach and touch a pencil, at another time I reach and do not... . In neither 
the light nor the dark does the pencil elicit my response (as a shock elicits 
flexion), but in the light it sets the occasion upon which a response will be 


reinforced. (Skinner, 1938, p. 178) 


Skinner explored this three-term relation experimentally with lights as 
discriminative stimuli, the lever presses of rats as responses, and food pellets as 
reinforcing consequences. His example of reaching for and touching objects in the 
Seen environment illustrates the broad range of situations to which the concept 
of stimulus control applies. The three-term relation, discriminative-stimulus— 
Tesponse_consequence, will be a recurrent theme. Each term is critical, because 
their combination distinguishes this case from other, simpler behavioral relations. 
In the Pavlovian situation, for example, in which a stimulus presentation is 
signalled, the organism’s behavior has no effect on the sequence of events; no 
consequences are arranged for responses. Consider the earlier examples. Our 
blinking Or startling at the lightning flash will not prevent the subsequent clap of 
thunder. But if a traffic light is red as we approach an intersection, our stepping on 
the brakes is occasioned by this stimulus only because we have learned the potential 
Consequences of doing or not doing So. Only the second of these two examples 
volves all of the terms of Skinner’s three-term contingency. An important 
difference in vocabulary accompanies these distinctions. When a stimulus is the 


brimap, cause 0 f a response, we say that the stimulus produces the response or that 
he repo is elicited. But when a response occurs In the presence of a stimulus 
ecause the stimulus signals some consequence of responding, we say that the 
Tiny, Occasions or evokes the response and that the response is emitted. 
The development of stimulus control over consequential responding has been 
called discrimination learning. Early animal experiments often were concerned not 
1 much with the nature of discrimination learning 2s ya ক 
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...it requires the animal to jump against the stimulus patterns from a 
distance, instead of to run past them. ...1] have usually trained the animals 
by placing the stand against the screen and allowing the animals to step 
through the open holes to the platform, then gradually withdrawing the stand 
until, in ten or fifteen trials, the distance of 25 cm. is reached. Cards are then 


placed in position and training in discrimination begun. (Lashley, 1930, pp. 
454-457). 


In Lashley’s apparatus, rats typically learned to discriminate black from white with 
perfect accuracy within four or five trials, and even more difficult discriminations 
such as vertical versus horizontal could usually be mastered within less than fifty 
trials. 

These are still cases in which discriminative stimuli signal consequences of 
responding, but they are more complex than the example in which a rat’s lever 
presses produced food in the presence but not in the absence of light. There, our 
concern was only with how often lever presses occurred when the light was on and 
when it was off. Consider, however, the jumping-stand. We may regard it as 
involving only two responses and their respective consequences: jumping toward 
vertical and finding food, and jumping toward horizontal and landing in the net. 
But assuming that the left and right sides are irrelevant to the rat is only our 
prejudice. Perhaps we should treat it as involving four responses, each with a 


Figure 2-4. The Lashley jumping-stand (Lashley, 1930, Figure 1). A rat was trained 
to jump from the stand (S) to one of the two doors (L and R). If the rat jumped 


I correct door, the door Eave way and the rat reached the food platform (FP). 
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being trained to iump vertical lines. iS TRE ee Le 
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particular consequence: jumping to vertical on the left, to vertical on the right, 
to horizontal on the left, and to horizontal on the right. Rats may be more likely 
to respond consistently on the basis of position, left or right, than on the basis of 
Stimulus cards. For example, if the first four trials of vertical-horizontal training 
Were set up with vertical on the right, as in Figure 2-4, we should not be surprised 
if on the fifth trial, with vertical on the left for the first time, the rat jumped right, 
toward horizontal. Until the fifth trial, jumping to the right was as effective in 
leading to food as jumping toward vertical. 

The kind of discrimination in which a single stimulus is present or absent, as 
in the lever-pressing example, is called successive discrimination. That in which two 
Or more stimuli are present at the same time and in which each is correlated with a 
different response, as in the jumping-stand example, is called simultaneous discrim- 
ination. Both illustrate stimulus-control operations superimposed on consequences 
of responding. The comparison between successive and simultaneous discrimina- 
tions shows that stimulus-control operations come in varying degrees of complexity. 
In fact, an important feature of many experiments on behavior is the analysis of 


complex situations into their component operations. 


RECAPITULATION 


The study of behavior is concerned with relations between environmental events, 
stimuli, and the organism’s actions, responses. These relations can be examined 
by analyzing how manipulations of the environment, experimental operations, 
Produce changes in responding, behavioral processes. A critical first step is 
Observation of behavior. Because we always observe behavior in some environment, 
it is typically not enough simply to watch an organism. To I 2 
We must intervene by changing the environment. We can Ie e environmenta 
Changes in terms of three classes of experimental operations: eo 
Operations, consequential operations, and stimulus-control operations. Table - 


Summarizes these operations. 


Presenting stimuli to an organism is the simplest operation. When we present 


Stimuli, we can observe the responses produced. The major effect of the stimulus- 
Presentation operation is to make particular IESPOns6s more or less SL For 
example, we might make a rat salivate by presenting food, but the rat রি stop 
Salivating and instead jump and squeal if we presented electric shoc - Stimulus 
Presentations are relevant to learning because the TESPONSES produced by stimuli can 
Vary depending upon the conditions under which stimuli are RE 

The things that happen in an organism's environment do not a ways happen 
by themselves. Sometimes the organism makes them happen; BCA can have 
Consequences. Environments can be arranged in which the organism's responses 
have effects, Arranging the environment so that an organism s responses produce or 
temove or prevent stimuli is called a consequential operation. For example, we 
Could present food to a rat whenever it reared up on its hind legs, or electric shock 
Whenever it moved to a certain floor area. As a result of consequential operations, 
Some responses may occur more often and others less often. If an organism 


‘sn[nwns Jo aouasaid 
ur paiiiuia aq Aew 
asuodsa1 ‘asuodso1 
2Y0d2 IO u01snI90 

Aew sn[nwns 


‘sn[nuwns 

Aq pailioija aq Aew 
asuodso1 ‘ asuodsor 

€ 1191/2 Aew sn(nwns 


‘pallu 
aq Aew asuodsoy 


‘sn[nums 

Aq palioija aq Aew 
ISUOdSs21 ‘asuodsaI 
119172 Aew snnwns 


"(H) uolDsiaauo2 

40] (nunjuoddo 
opiAoid Aew (5) 
SuluamsuD Yeu} sSfeu3Is 
(4) 2u0oydajal 3ui3ury 
‘(H) 1294913 

91Yf041 40 1uap1929D 

01 peo[ Aew (5) 
U011925421U1 YSNOAY1 
3u1dl14p Yeu] STeu3Is 
(4) 14811 24f041 pay 


(J) apunys 
saopavaid (@) 3utu1ysT 


‘(g) UMOAYT 51 YINIMS 
UUM (9) Ino sa08 1YIHT 
(9) wing soonpoid 

(g) ado1s 104 SuryImoL 
(9) Yuup UYos 

saonpoid (g) 2u1y9Dw 
Suipuad ur utog Suning 


“Wexo SuLinp 

aA2 s.quoned oyur 

(P) 148!1 saurlus 10790 
‘PIruo 

sofiieas (¥) 2as1ou pnoT 


"H aduanbasuon 
ABU [TIA 5 asuodsay 
Jeu] sfeu3is J snmwns 


“J snnuns 
Jo uonequasoid 
sfeu3Is G sn nuns 


‘(pojeuTwIo] ST 

10 paonpoid st sn[nwns 
B “3:9) 2 aouanbasuon 
sey g asuodsoy 


paluasaid st py snmnwns 


saguanbasuoo 
uo pasodwriodns 


uonejuasaid snnwns 
uo pasodwuodng 


uonteiado 


Ionuoov-snnwns 


uonreiado fenuanbasuon 


uonreiado 
uonejuasaid-snmwns 


250sN 


Sajduinx 


uonudu2sag 


uon1ado 


‘suonteIodO [eioraeuyog Seg 


FTaAIgVvL 


36 


Behavior 37 


responds more often because its response changes its environment, we say that the 
response was rewarded or reinforced; if an organism responds less often for the 
Same reason, we say that the response was suppressed or punished. The effects of 
Consequential operations, the ways in which they change behavior, are critically 
important; we must deal with these operations when we consider how organisms 
can alter their world. 

We can complicate matters by superimposing still another operation on 
Stimulus-presentation operations or on consequential operations. We can arrange 
that these simpler operations are in effect only when some signal is present. 
Organisms do not behave indiscriminately. They do some things in some circum- 
stances and other things in other circumstances. These relations are studied by 
arranging stimulus-control operations: one stimulus is used to warn of the imminent 
Occurrence of another, or to signal the conditions under which a response may have 
Some consequence. For example, we might present food to a rat only when a 
buzzer sounds, or electric shock only when a light is on. In these instances, a 
Stimulus-control operation is superimposed upon a stimulus-presentation operation: 
the buzzer signals food presentations and the light warns of shock presentations. 
Stimulus-control operations can also be arranged in conjunction with RE 
Operations. For example, a rat’s rearing up on its hind legs might 0 9 ly 
if a lamp is lit, or its movement to a certain area of the floor might produce e ectric 
Shock Only if a tone is sounding. In these instances, a eR BT Ss 
Superimposed upon a consequential operation: the lamp Ne 2 
Tesponse will produce food and the tone indicates when a nn a 
Produce shock. These stimulus-control operations may lead a c oe in Ee 
Produced by stimulus-presentation operations or Si LEU oe 
Only in the presence of the corresponding stimuli. রী AS ছি a 
Organism that some event is about to occur or that a a Et ER 
Consequences, the organism may come to respond HIeren ed UD 
is present than when it is absent. This behavie Ponto! of the discriminative 
and the organism’s behavior is said to be under the control 0 
Stim h Hl 

RE is complicated because different stimuli can have different effects 
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When a stimulus elicits or reliably produces a response, this stimulus-response 
relation is called a reflex. The Indo-European root steig-, to stick, is an 
ancestor of stimulus. The same root also generated distinguish, instinct, and, 
via stylus, a writing instrument, style. Through the Old French estiquet, to 
impale and later to label, it also produced a modern French word now related 
to behavior, etiquette. The Indo-European root spend-, to pour a libation or 
to make a treaty, led to the Greek sponde, a drink-offering, and the Latin 
spondere, to promise. Through these words, response, originally an under- 
taking in return, is linked to spouse, sponsor, and perhaps even to 
spontaneous. The latter relation is interesting because response now refers to 
a unit of behavior that need not be produced by a stimulus; responses may be 
elicited by stimuli but they may also occur spontaneously, when they are said 


to be emitted. 


ing stimuli. First we will examine some 
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reaction. This reliable relation between an environmental event, a stimulus, and a 
resulting change in behavior, a response, has been called a reflex. The vocabulary of 
the reflex has a long history (Fearing, 1930), travelling through a complex route 
from the philosopher Descartes to its contemporary usage. Descartes, a seventeenth- 
century philosopher, was familiar with hydraulic devices constructed to amuse 
visitors in the royal gardens of France. Stepping on a concealed trigger released a 
flow of water that made statues move. Descartes saw a similarity between such 
devices and the reflex. According to Descartes, stimuli were comparable to 


Strangers, who entering into one of the grottoes containing many fountains, 
themselves cause, without knowing it, the movements which they witness. 
For in entering they necessarily tread on certain tiles or plates, which are so 
disposed that if they approach a bathing Diana, they cause her to hide in the 
Tosebushes, and if they try to follow her, they cause a Neptune to come 


forward to meet them threatening them with his trident. (Descartes, trans- 
lated in Fearing, 1930, pp. 20-21) 


Just as a step on the concealed plate triggers the movement of a statue, a stimulus 
triggers a response. For Descartes, the role of pipes and water in these statuary 
Systems was played by nerves and animal Spirits in living organisms. 

For our purposes, the most important part of Descartes’ concept of the reflex 
was that the term incorporated the observation that behavior is sometimes caused 
by environmental events. Thus, a child who touches a flame will quickly withdraw 
its hand. The development of physiology as a science turned attention toward the 
mechanism of such behavioral relations, and experiments explored the components 
of the reflex arc, the path from the Original sensory impact of the stimulus through 
the central nervous system and then back to the muscular or glandular system 
within which the response occurred. The analysis of the reflex became more and 
more sophisticated (e.g., Sherrington, 1906), and in the conditioned-reflex concepts 


of Pavlov (1927) and the related behaviorism of Watson (1919), it became the basic 
behavioral unit (see also Phillips, 1971). 


| We noted two illustrations of reflexes above, 
withdrawal from a flame. Many others are familiar: 
tap on the patellar tendon; salivation produced by 


constriction produced by bright light; the postural 
abrupt loss of support. 


in the startle reaction and in 
the knee jerk produced by a 
food in the mouth; pupillary 
adjustments produced by an 
common feature that some 
S the defining property of a 
se by a specified stimulus. In 
its the response, or that the 
response is an elicited Tesponse. ANSTO Stim, an dR 


Elicitation 
41 


Observed a response without an identifiable stimulus. By itself, the response of 
pupillary constriction is not a reflex. We should not confuse pupillary constric- 
tions produced by bright light with those produced in other ways. The pupillary 
reflex is defined by the relation between bright light and pupillary constriction, 
and reflexes in general are defined by the relation between eliciting stimuli and 
elicited responses. Many reflexes have been named for their characteristic responses; 
the salivary reflex and the patellar reflex are examples. But it is useful to remember 
that these are not names for responses; if we observed salivation or knee jerks in the 
absence of eliciting stimuli, the reflex terminology would be inappropriate. 


PROPERTIES OF ELICITED BEHAVIOR 


Once we identify a reflex relation between some stimulus and some response, we 
can examine properties of the reflex (cf. the Laws of the Reflex; Skinner, 1938). 
Consider an eliciting stimulus such as an acid or sour-tasting solution on the tongue 
(e.g., vinegar) and an elicited response such as salivation. Above some minimum 
Value, called the threshold, acid on the tongue will elicit salivation, but a very low 
concentration or a very small quantity may not do so; when the magnitude of a 
Stimulus is not sufficient to elicit a response, the stimulus is said to be below 
threshold. As we concentrate the solution or increase its quantity, or both, we will 
reach values that are more and more likely to produce salivation. At a given 
concentration, we can find out how many drops must be placed on the tongue to 
elicit salivation reliably; conversely, for a given number of drops, we can find out 
what concentration is required. Some concentrations and quantities will have only 
marginal effects, sometimes eliciting salivation and sometimes not. 

Note that a stimulus does not itself have a threshold. Rather, we determine 
threshold values for particular features of stimuli. When we do so, we hold other 
features of the stimulus constant. For example, in eliciting salivation, the threshold 
number of drops would typically be smaller for a strong acid solution than for a 
Weak acid solution; it would therefore not be meaningful to specify the threshold 
number of drops without also specifying the acid SOfcEnUrA On; or Vice versa. 

Elicited responses have varied properties. Some time must elapse between 
stimulus and response; this time period is called tiie latency of the response. In 
addition, the response must exist in some magnitude and have some duration. 
Because these properties may Vary together, they have sometimes been given a 
common name, reflex strength. Thus, reflex strength would be weak if responding 
Occurred with long latency, small magnitude and short duration, Whereas it would 
be strong if responding occurred with short latency, large magnitude and long 
duration. | Hl 

The eliciting effect of a stimulus does not involve any direct transfer of 

the organism. Sechenov (1863), a Russian 


energy from the environment to e ( 
Physiologist, noted that the organism’s energy expenditure in many responses 
(e.g., sneezes and coughs) far exceeds the energy provided by the eliciting stimulus. 


Thus, observed Sechenov, the stimulus should be regarded as a kind of trigger, 
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releasing energy that the organism already has available in muscles or glands or 
other structures. This view was, of course, consistent with Descartes’s conception 
of the reflex. 

Even though the eliciting stimulus is most accurately regarded as a trigger 
that releases energy already available in the organism, the magnitude of the eliciting 
stimulus may affect the elicited response. Response latency typically varies 
inversely with stimulus magnitude; in other words, response latency decreases as 
stimulus magnitude increases. And response magnitude and response duration 
typically vary directly with stimulus magnitude; in other words, these measures 
increase as stimulus magnitude increases. Saying that reflex strength increases with 
stimulus magnitude simply summarizes these relations. 

Once it became common to speak of reflex relations in terms of strength 
rather than in terms of specific measures, it also became easier to regard the reflex 
relation as a fundamental unit of behavior: reflexes began to be treated as building 
blocks from which more complex behavior was constructed. The concept of the 
reflex had a tempting simplicity, and as it came to be more widely accepted as a 
behavioral unit, it seemed reasonable to conclude that reflex relations might be a 
basis for understanding a variety of behavioral processes. Some stimuli had been 
identified as causes of some responses, and the faulty assumption was made that 
for every response there must exist a corresponding eliciting stimulus. 

Pavlov’s conditioned-reflex system and Watson’s behaviorism of the 1920s 
and 1930s grew out of such an assumption. With the reflex serving as a unit of 
behavior, complex behavior was then treated as the combination of reflex units. 
When a response occurred with no observed eliciting stimulus, stimuli with appro- 
priate properties were hypothesized. In addition, the responses of one reflex were 
assumed to have stimulus properties that enabled them to elicit other responses in 
turn. Thus, behavior that extended over long periods of time could be interpreted 
4s a sequence or chain of reflexes, in which each response functioned 
simultaneously as the elicited response of one reflex and the eliciting stimulus of 
the next. These reflex Systems were elaborated in various ways, but despite the 
ingenuity and the dedication of their proponents, they no longer command 
substantial attention in the Psychology of learning. The concept of the reflex has a 


place in the analysis of behavior, but by itself it is not adequate for thoroughly 
describing what organisms do. 


Eliciting stimuli and response probabilities 


Of the possible relations between stimuli and responses, the reflex is one 
Specialized relation among many. In a reflex, the presentation of some stimulus 
reliably produces some response. But the stimulus of that reflex may affect other 
responses differently, and the response of that reflex may be affected differently 
by other stimuli. For any particular stimulus, its presentation may increase the 
likelihood of some responses, decrease the likelihood of others, and have no effect 
On still others. For any particular response, its likelihood may be increased by some 
Stimuli, be decreased by others, and be unaffected by still others (Catania, 1973). 
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Furthermore, it is not enough simply to note that a response reliably follows a 
Stimulus to conclude that the language of the reflex is appropriate. We must also 
know how likely the response is without the stimulus. For example, if a rat in a 
Tunning wheel spent much of its time running and it ran in noise as well as quiet, 
We would not want to say that noise had elicited running just because we observed 
the rat running when we presented noise. To specify the relation between a 
Tesponse and a stimulus we must know how likely the response is in the absence 


of the stimulus as well as in its presence. 
Consider watching a dog. The dog m 


নথ ears or bark. If we give the dog some foo 
egins. If we then introduce a strange dog, the eating will stop and the first dog 


may grow] and assume an aggressive posture. If the other dog is removed and the 
rst dog’s master enters, we may observe barking and jumping and tail-wagging. 
But if the master now scolds, the dog may whine and its tail may drop down 
between its legs. The list of possible effects of various stimuli could easily be 
eXtended, but these examples are sufficient. The food, the strange dog, the master 
And the master’s scolding each make some responses more likely while they make 
Others Jess likely. Some of the responses might be observed from time to time even 
1 the absence Df these stimuli, and none will necessarily occur every time a 
Particular stimulus is presented. In a reflex relation, a response that is infrequent in 
the absence of some stimulus occurs reliably when that stimulus is presented. As 


‘le above examples indicate, this class of stimulus-response relations is only one 
ulus-response relations. The response of bark- 
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and responses can be described best in quantitative terms. We must introduce the 
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Here, p stands for probability, and the terms in the parentheses are abbreviations 
for the events whose probabilities are specified in the equation (in this instance, 
R for responses, where the subscripts 1, 2, and 3 correspond to the three successive 
measurements of the Babinski reflex). These examples also illustrate that probabili- 
ties are defined in such a way that they can have values only within the range from 
0.0, when the event never occurs, to 1.0, when the event always 0ccurs. 

Conditional probabilities. The Babinski reflex is an uncomplicated example 
for our purposes, because the toe-splaying response does not occur often in the 
absence of its eliciting stimulus. Suppose, however, that we study the blinking 
elicited by a puff of air to the eye. If we study this reflex with an infant whose 
blinking already occurs at the rate of a blink every one or two seconds, how can we 
tell elicited blinks from those that would have occurred even in the absence of the 
air-puff? In fact, we may never be able to say whether or not any particular blink is 
elicited. But by comparing the probability of a blink after an air-puff with the 
Probability after no air-puff, we may at least be able to assess the effect of the 
stimulus. Our procedure is illustrated in Figure 3-1. We watch the eyelid and record 
blinks within a period of time after each stimulus, and we compare responding in 
these time periods with responding in equivalent time periods that do not follow 
a stimulus. 
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Figure 3-1. Estimating eye-blink probability with and without an eliciting puff of 
air. Each solid vertical line represents a blink. In Line A , nO air-puffs were presented, 
and the dashed lines mark off five one-second periods during which blinks were 
recorded. Blinks occurred within three of these five periods, and the probability of 
a blink in the absence of a stimulus is therefore 0.6. In Line B, each arrow 
represents a puff of air to the eye. An eye-blink occurred in each of the five 
one-second periods that followed these stimuli. The probability of a blink given an 
air puff is therefore 1.0. The two blinks produced by stimulus X, in time-period 3, 
did not both count toward this probability; a time period is scored either as one in 
Which no responding occurred or as one in which at least a single response occurred, 
and probability is then calculated by dividing the number of time periods with at 
least one response by the total time periods. These data can be summarized by 
saying that the air-puff raised the probability of an eye-blink from 0.6 to 1.0. (This 


example is hypothetical; an actual experiment would use a much larger sample of 
Observations to estimate probabilities.) 
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The probabilities in this example are expressed in the following notation: 
D(R/S) = 1.0; P(R/S) = 0.6. 

In this notation, Ris a response, S is stimulus, and a bar over a symbol (as in 5) is 
A negative sign, equivalent to no or not. Thus, the first equation may be read as: 
Probability of a response given that a stimulus has been presented (eye-blink given 
alr-puff) is 1.0. The second equation may be read as: probability of a response given 
that a stimulus has not been presented (eye-blink given no air-puff) is 0.6. This kind 
Of probability, in which the probability of one event is specified in terms of the 
Presence or absence of another event, is called a conditional probability (the 
terminology should not be confused with that of Pavlov’s conditioned or 
conditional reflexes, even though both cases have the common feature that one 
event is a condition for some other event). Within the parentheses, the term to the 
left of the slash represents the event whose probability is measured, and the term 
to the right represents some other event upon which the first is conditional. In 
Other words, D(A/B) can be read as: probability of A given B; or, probability of A 
On the condition that B has occurred. Similarly, p(X/Y) can be read as: probability 
of x given not-Y; or, probability of X on the condition that Y has not occurred. 

Later discussions will favor the language of probabilities over other ways of 
describing behavior, but usage in psychology varies enough that such choices of 
Vocabulary will often be arbitrary. There is not much difference between saying a 
Stimulus produced a response and saying a stimulus elicited a response. These usages 
Would be applied most often to single instances (e.g., this particular presentation of 
Stimulus M was followed by this particular instance of response N). Neither is there 
Much difference between saying a stimulus increased response likelihood and saying 
8 Stimulus raised response probability. These usages would most often be applied 
to the average effect of a stimulus over many instances (e.8., Stimulus O is usually 


fi 
Ollowed by response P). 


Types of stimulus-response relations 

Now we may summarize some relations between stimuli and responses in 
terms of conditional probabilities. For this purpose, we will use the coordinate 
System in Figure 3-2. The y-axis is a scale of response probability given that a 
Stimulus has been presented; the x-axis is a scale of response probability given that 
the stimujys has not been presented.’ In other words, any point on this graph 
tepresents two conditional probabilities: response probability given a stimulus, 
D(R/S), ang response probability given no stimulus, P(R/S). For example, the point 
labeleq Ain the left rr ph of Figure 3-2 represents a stimulus-response relation in 
Which response probability is 0.75 when the stimulus is presented and only 0.50 
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Figure 3-2. Various stimulus-response relations represented in terms of response 
probability in the presence of a stimulus, p/R/S), and in the absence of the 
stimulus, p(R/S). The left graph shows examples of stimulus-response relations in 
which a stimulus raises response probability (A), has no effect on response 
probability (B), or lowers response probability (C); the increase in response 
probability called a reflex, in which the stimulus reliably produces the response, 1S 
illustrated at D. The right graph shows possible stimulus-response relations that 
might be observed with various combinations of responses (a dog’s salivation, growl- 
ing or tail-wagging) and stimuli (its master, food or a strange dog). This graph makes 
the point that stimuli can have different effects on different responses, and that 
responses can be differently affected by different stimuli. 


when the stimulus is absent; in this instance, the stimulus raises the probability of 
the response. 

The diagonal in these graphs is of particular interest. A response that occurs 
without being elicited by a stimulus is said to be emitted. The diagonal represents 
Sstimulus-response relations in which response probability is independent of or is 
unaffected by the stimulus. Thus, at point B response probability is 0.25 whether 
Or not the stimulus is presented. In such cases, it would be inappropriate to say that 
a stimulus produced or elicited the response. To say a response has been elicited it 
is not enough to know that the response followed a stimulus. A response can 
happen to be emitted just after a stimulus is presented. Thus, when a response 
follows a stimulus, we cannot say that the response was elicited unless we know 
that the stimulus actually produced or caused the response; otherwise, we must say 
that the response was emitted. 

A third class of stimulus-response relations is illustrated by point C,, for which 
the response probability of 0.90 when the stimulus is absent is reduced to 0.10 by 
the stimulus; in this instance, the stimulus makes response probability decrease. (In 
some behavioral vocabularies, such reductions Of response probability by a stimulus 
are called reflex inhibition; for example, if a dog’s tail-wagging stopped with the 
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we might say that the strange dog inhibited 
f the diagonal represent cases in 
whereas those below and to the 


Introduction of a strange dog, 
tail-wagging.) All points above and to the left 0 
Which a stimulus increases response probability, 
Tight represent cases in which a stimulus decreases response probability. 

Among those cases in which a stimulus raises response probability are those in 
Which virtually every stimulus presentation produces the response. For example, at 
Point D a stimulus raises response probability from about 0.1 to about 1.0. This is 
the kind of stimulus-response relation we have been calling a reflex. The graph 
demonstrates that this relation is only one extreme in a range of possible stimulus- 
tesponse relations. Somewhere between the upper edge of the graph, where D(R/S) 
ls near 1.0, and the diagonal, where S has no effect on R, we might decide that the 
eliciting effect of the stimulus is no longer reliable enough to justify calling the 
Stimulus-response relation a reflex. But where should the boundary be? We probably 
Would not exclude cases in which p(R/S) is just a little less than 1.0 (e.g8., 0.98, or 
Perhaps even 0.95). But neither would we want to include very Slight effects of 
Stimuli as reflexes. For example, would we wish to speak of a stimulus-response 
relation as a reflex if a response with a probability of 0.2 were raised by a stimulus 


only to 0.21, or even to 0.25? j f 
Any boundary we could draw to separate those SEimulisTesponss relations 
We cal] reflexes from those we do not will be arbitrary. This is an important conclu- 
Sion. We saw how some earlier behavioral systems, such as those of Watson and 
Pavioy, based their development on the reflex as a fundamental unit of behavior. 
0 doubt highly reliable reflex relations were easier to work with than other, less 
teliable stimulus-response relations. But if Figure 3-2 is appropriate for describing 
Stimulus response relations and if the reflex is only a special case among these 
telations, (hen any system of behavior built solely on the reflex as a behavioral 
Unit was ing out. Re 
one" pee TE out of early accounts ৬ পলা ্ 
Tesponses, the occurrence of responses in the absence of eliciting stimull. mitte 
tesponses were given such names as instrumental or operant ad because 
Cy were studied in terms of how they were instrumental in producing changes in 
the environment how they operated on the environment. They derived their 
portance Fa ee their relation to eliciting stimuli but rather their 
COnsequences, By contrast elicited behavior was called I Cf কাল 
S distinction had been made, other features Were a ee GE NA 
Argueq that instrumental or operant behavior consisted of ske A al response ঢা 
S movements of the limbs, whereas reflex Or respondent be A ke 
Mtonoimic.esnonses. such Hs glandular secretions. This ie ER also seen 
as Paralleling tie traditional distinction between or ডে RR 
Ty recently have these distinctions been seriously ques ETS a Pp Le 
Can be elicited and glandular responses can be emitted. It is i a EE nr 
€ distinction between elicited and emitted responding. But we have seen that the 


Same ited and sometimes emitted; thus we cannot 
Tespons ometimes elicited an pe 
dlassify Ponse may be s ‘to these two categories on the basis of such 


responding effectively in 
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physiological properties of behavior as the difference between skeletal and 
autonomic responses. 

Now let us return to Figure 3-2, to consider specific cases of stimulus-response 
relations. The right graph of Figure 3-2 shows hypothetical relations involving three 
stimuli and their effects on three responses. A dog’s salivation and growling and tail- 
wagging are represented by three locations on the x-axis, and the master, food or 
a strange other dog are represented by filled circles, unfilled circles, and unfilled 
triangles, respectively. The probability of salivation is raised both by the master 
and by food but is lowered by the strange dog; these stimuli have similar effects on 
tail-wagging, except that food elicits salivation more effectively than does the 
master whereas the master elicits tail-wagging more effectively than does food. 
Conversely, growling becomes more probable in the presence of the strange dog and 
less probable in the presence of the master or food; we might also say that food has 
a larger inhibitory effect than the master. 

This graph supports an earlier point. When we defined the reflex, we argued 
that the reflex is neither stimulus nor response but rather the relation between 
them. The graph represents stimulus-response relations; it cannot represent stimuli 
by themselves or responses by themselves. Thus, food as a stimulus does not have 
any single location in the graph; its location depends on the response we are 
measuring. Similarly, tail-wagging as a response does not have any single location 
in the graph; its location depends on the stimulus we are presenting. 


THE TEMPORAL STRUCTURE OF ELICITED BEHAVIOR 


We have seen some complexities of analyzing the relations between stimuli and 
responses. Of the possible relations between stimuli and responses, the reflex is 
one specialized relation among many. In a reflex, a specified stimulus reliably 
produces a particular response. But the stimulus of that reflex may affect other 
responses differently, and the response of that reflex may be affected differently 
by other stimuli. For any stimulus, its presentation may increase the likelihood of 
some responses, decrease the likelihood of others, and have no effect on still others. 
For any response, its likelihood may be increased by some stimuli, decreased by 
others, and unaffected by still others. We now move on to examine how elicited 
responding changes over time. 


Elicitation and response sequences 


Presenting a stimulus may determine not only whether a response occurs at 
a particular moment but also the sequence of responses that occurs over an 
extended time. For example, if a rat is deprived of food and we give the rat a food 
pellet, the rat will eat the pellet. Once it has finished eating the pellet, the rat will 
then typically drink water if water is available. This relation between eating and 
subsequent drinking is so strong that by delivering food in small amounts over an 


extended period we can make the rat drink several times its ordinary daily ration 
of water (Falk, 1961, 1971). 
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Adjunctive behavior. Falk named this increase in drinking polydipsia, and he 
called responding that depended in this way on other responding adjunctive 
behavior. Adjunctive behavior, therefore, is behavior in which one response reliably 
accompanies some other response produced by a stimulus. Polydipsia, as one type 
of adjunctive behavior, follows at least partly from the way in which eating and 
drinking are related in the rat’s normal feeding pattern. If food and water are 
freely available, the rat ordinarily takes a few large meals daily and drinks after each 
meal. If we then force the rat to take many small meals by delivering food in many 
Small portions at intervals of one or a few minutes, the rat still drinks after each 
meal but does not reduce the size of each drink enough to compensate for its more 
frequent drinking. Thus, a rat that went from five large meals to fifty small meals 
per day would now drink ten times as often. But if drinks taken after each of the 
fifty Small meals were only half instead of one-tenth the size of drinks taken after 
each of five large meals, the rat would drink five times as much as before. Such 
Increases in the rat’s daily water intake can be readily produced, and polydipsia is 
39 reliable in rats that it has been used to get rats to ingest substances that they 
Ordinarily reject (e.g., alcohol: Meisch and Thompson, 1971). 


Adjunctive behavior includes other responses besides eating and drinking. 
at instead of water, running in the 


y as drinking follows eating in 
ample involves aversive stimuli. 
he monkey will typically bite 
po thing nearby it can sink its teeth into. The electric shock, in other words, elicits 
‘ting. Once the biting stops, the monkey then will typically engage in manual 
tesponses, such as grabbing rid shaking levers, chains or other manipulable objects 
utchinson, Renfrew, and Young, 1971). The manual responses may be regarded 
I adjunctive behavior relative to the shock-elicited biting. ae d 
Interim and terminal behavior. The presentation of stimuli may impose 
temporal order or structure on behavior. A stimulus may generate a sequence of 


ferent responses. Figure 3-3 illustrates 8 that examined patterns of 
“lavior generated in pigeons by repea 

‘mmelhag, 1971). Each Eh was placed in a chamber on one wall of which was 
An opening to a feeder. The feeder consisted of a tray of grain ordinarily out of the 


Pigeons E jshting the tray and lifting it to a position 
reach; food was presented by lig aes a pigeon’s behavior during 


Where t 5 sn Fi 3-3 

Nia on successive 2-second food deliveries. 

Sco he experimenters observed the pigeon through a window in SRE GU 

red responding in various categories. Some categories were mov. , such a: 

Pecks directed toward the feeder wall or toward the floor; others were postures 
at could be maintained for some time, such as orientation toward the feeder wall. 
© Categories were determined by watching many pigeons for টে ES 

Figure 3.3 shows that early in each 12-second interfood perioc, Bir 


Usuayy but less frequently, put its head into the 
feeqe jgeons showed patterns that 


ing. These responses became 
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Figure 3-3. Probability of a pigeon’s responses in successive seconds of a 12-second 
period between successive food deliveries. Food deliveries consisted of 2-second 
access to grain in a food hopper located in one of the chamber walls. Data are 
based on the last three of 109 sessions of 64 food deliveries each. Orientation 
toward the feeder wall, not shown, increased in probability over each 12-second 
period. Low-frequency responses such as wing-flapping and preening are also 
omitted. A sequential response pattern is implied by the different times at which 


each response reaches its maximum probability. (Adapted from Staddon and 
Simmelhag, 1971, Figure 1) 


less likely and pecking became the dominant or most likely response as the time of 
the next feeder presentation approached. Staddon and Simmelhag called the varied 
responding early in each period interim behavior (in Figure 3-3, quarter-turns, 
head-in-feeder and floor-directed pecks). The more consistent responding later in 
each period, which typically included some form of pecking, was called terminal 
behavior (in Figure 3-3, wall-directed pecks). 

The original purpose of Staddon and Simmelhag’s experiment was to evaluate 
an earlier experiment on repeated food presentations by Skinner (1948). Skinner 
Observed that pigeons tended to repeat those responses that had just preceded 


earlier food presentations, and referred to this repetition as an experimental super- 
stition. 
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The conditioning process is usually obvious. The bird happens to be executing 
some response as the hopper appears; as a result it tends to repeat this 
response. If the interval before the next presentation is not so great that 
extinction takes place, a second “contingency” is probable. This strengthens 
the response still further. ... The experiment might be said to demonstrate 
a sort of superstition. The bird behaves as if there were a causal relation 
between its behavior and the presentation of food, although such a relation 


is lacking. (Skinner, 1948, pp. 168-171) 


Skinner noted that the pigeon’s responding often changed gradually as the 
Procedure continued. He referred to such gradual changes in responding as topo- 
8raphical drift because they depended on accidental relations that developed 
between responding and food deliveries; the procedure did not seem to force 
Changes in the topography or form of responding in any particular direction. 
Although a pigeon might respond temporarily as if its responses Were producing 
food as a consequence, its responding would change or drift over the long run, and 
10 particular response could be identified consistently as superstitious behavior. 
Ser longer periods of observation, Staddon and Simmelhag observed that pecking 
eventually dominates as the response just preceding food deliveries in this kind of 
Procedure. 

Superstition remains somewhat controversial, and we will return to it later. 
For the moment, our concern is with the temporal structure that repeated stimulus 
Presentations impose on behavior. Responding early in interfood intervals, interim 

Chavior, varied from one pigeon to another, whereas later responding, terminal 
behavior, was fairly constant across pigeons and usually included some form of 
Pecking. Pecking is also the behavior occasioned by food presentations; the bird 


takes food into its mouth by pecking. At least in this case, terminal behavior seems 
to have something in common with responses produced by stimulus presentations. 
leash, electric shocks will elicit biting 


imilarly, if a monke lar with a 
: y wears a collar টন cb 
and i tations make the biting and 
pulli ted shock presen 
ng at the leash, but repe8 ° eceding each shock: Morse, 


Pulling oc El ‘7 the time just Pr 
cur as terminal responses, in the time J ! 
teag, ang Kelleher, 1967) je would be nice to conclude that repeated stimulus 


Presentations not only elicit responses but also produce terminal behavior closely 
elated to the elicited responding. Unfortunately, behavior is not that simple. 
searchers have not yet settled on how Or even on whether the classes of behavior 
ere called adjunctive or interim and terminal or SUDETST ET should be 
Stinguished. The problem is not about the facts of behavior, but about how these 


aq 
ts should be organized. 


Effects of successive elicitations 
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t one presentation of a given 
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h of lightning in a thunder 
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ns may not have the same effect as 
example, we may startle much more 
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storm than to later flashes of lightning. Furthermore, the effects of stimuli may 
depend on how quickly they follow each other. For example, if we are peeling 
Onions, the tears elicited by the present onion may depend on whether we began 
working on it immediately after finishing the last one or had just returned after 
taking a five-minute break. In other Words, to describe how stimuli can change 
behavior, it will not be enough merely to list the changes in responding produced 
by a stimulus. Elicited responding may depend on the number of stimulus presenta- 
tions or their spacing in time. This may be the point at which elicitation first 
becomes relevant to learning. If we show only that some stimulus elicits some 
response, we have not demonstrated learning. But we may wish to begin speaking of 
learning if the responding elicited at one time depends on what happened to the 
Organism earlier. 

Adaptation or habituation. We already mentioned the startle reaction pro- 
duced by an unexpected event such as a lightning flash or a sudden loud noise. Even 
without other events that signal its occurrence, a repeated loud noise will produce 
Successively smaller startle reactions, until eventually no behavior at all is 
Observable when the noise occurs. Many stimuli elicit responses called orienting or 
Observing responses; for example, a dog pricks up its ears in response to a novel 
sound or begins sniffing in Tesponse to an unusual odor. With repetitions of these 
stimuli, however, the dog’s responding decreases; the responding occurs with 
smaller magnitude and longer latency, until perhaps it vanishes completely. 

This decrement in responding with repeated stimuli has been given various 
names. For the stimulus-response relations called reflcxes, this decrement has been 
called adaptation or habituation. The terms have complicated histories, and the 
distinctions between them remain ambiguous. Both physiological and behavioral 
considerations enter into their usages, and the different terms are invoked on the 
basis of such circumstances as whether the observed decrements in responding can 
be attributed to specific kinds of changes in the nervous system (cf. Groves and 
Thompson, 1970). For our purposes, the terms adaptation and habituation are 
interchangeable. 

Adaptation or habituation is a characteristic of the elicited responding pro- 
duced by a variety of stimuli. It has been studied with such diverse responses as the 
change in skin resistance produced by electric shock (the galvanic skin response or 
GSR), distress calls of birds to the silhouette of a predator passing overhead, and 
contractions in earthworms produced by exposure to light, not to mention the 
Startle reactions and Orienting responses mentioned earlier (e.g., Ratner, 1970). It 
may also be an important component of the dynamics of emotion (R. L. Solomon 
and Corbit, 1974). 

Facilitation or potentiation. But stimuli may sometimes have opposite 
effects. For example, electric shocks elicit squealing in rats; if several shocks are 
delivered, later Presentations produce more responding than earlier ones (e.£., 
Badia, Suter, and Lewis, 1966). This effect has been called facilitation or potentia- 
tion. Either is an acceptable term for increased responding with repeated presenta- 
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tions of a single stimulus.> The process of facilitation or potentiation seems more 
characteristic of stimuli regarded as aversive or punishing than of stimuli regarded 
either as neutral or as appetitive or reinforcing. 

The method of stimulus presentation can determine whether adaptation 
(habituation) or facilitation (potentiation) is observed. Kimble and Ray (1965) 
Studied a wiping reflex in the frog: a bristle touched to a frog’s back elicits a move- 
ment over the back by the hind foot. In one group of frogs, successive touches were 
made within a specified region of the frog’s back each time, but within this region 
the exact location of stimulation could vary from one touch to the next; in a 
Second group, successive touches were made to precisely the same location on the 
frog’s back cach time. In both groups, 100 touches at l10-sec intervals were 
delivered each day for 12 consecutive days. The two groups began with roughly 
equal probabilities of elicited wiping movements, but this probability increased over 
Successive days for the first group whereas it decreased for the second group. In 
other words, facilitation or potentiation occurred when the location of the eliciting 
touch varied slightly from presentation to presentation, whereas adaptation or 
habituation occurred when the location of the eliciting touch was constant. 

In another example, Hutchinson, Renfrew, and Young (1971) examined 


biting responses elicited in squirrel monkeys by electric shock. When shocks were 
delivered at 30-sec intervals, the elicited biting responses gradually decreased over a 
60-min session. When shocks were delivered at 120-sec intervals, these responses 
temained roughly constant over the session. And when shocks were delivered at 
240-sec intervals, responding gradually increased over the session. Thus, the same 
Stimulus, depending on its frequency of presentation, could elicit responding that 
demonstrated adaptation, facilitation, or neither of these phenomena. { 
Effects of time since the last eliciting stimulus. Repeated presentations of 
a stimulus change the extent to which that stimulus elicits responses. If the stimulus 
is then absent for some time, the probability that responding will be elicited may 
return to earlier values. For example, the startle reaction to loud noise may 


diminish or even disappear after several loud noises in succession over a period of 
a few minutes, but may appear again in full strength if a loud noise later follows 
hours of ence Such an effect of the time between stimuli follows from the 
Nature of adaptation and facilitation. If elicited responding decreased over 
uA lier higher levels after the 


Successive stimuli, we would expect it to recover to its earlier 
Stimuli were discontinued. Conversely, if elicited responding increased over success- 


ive stimuli, we would expect it to return to its earlier lower levels after the stimuli 
Were discontinued. The changes in responding called adaptation and facilitation are 
not permanent, and the return to earlier levels takes place as time passes. In fact, 


onally is sensitization, but sensitization has an alternative 


And more widely accepted usage. It refers to an increase in the eliciting effect of one stimulus 
aS a result of presentations of some other stimulus; one stimulus amplifies the eliciting effect 
of another stimulus (for example, a flash of light may make it more likely that a later loud 


noise will produce a startle reaction). 
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FACILITATION or POTENTIATION 


Response Probability or Reflex Strength——> 


A 


¥ z 
j 
ks Time —— Zz 


Successive stimuli. In both 
time since the last stimulus 


this temporary nature of the decrease or increase in response Probability is part of 
the definition of adaptation and facilitation. 


These relations are Summarized in Figure 3-4. Each line shows hypothetical 
effects on response probability or reflex Strength of ten successive stimuli. The 
Upper part of the figure (4, B and C) illustrates adaptation or habituation; the 
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lower part (D, £ and F) illustrates facilitation or potentiation. In both cases, three 
examples are given that differ only in the time separating the first five stimuli from 
the second five stimuli. As this time becomes longer (from A to C and from D to 
F), the responding produced by the second five stimuli becomes more like the 
responding produced by the first five stimuli. In other words, responding returns to 
its earlier levels with the passage of time. For example, in A responding is elicited 
by the later stimulus x with a lower probability than by the first stimulus w. But 
When stimulus x is presented after successively longer periods without stimuli, as 
in B and C, the probability with which stimulus x elicits responding approaches that 
with which stimulus w elicited responding. Analogous relations exist for stimuli y 


andzinD,E andF. 


The role of exercise 


We argued above that adaptation and facilitation are not ordinarily 
permanent. If decreases in responding were irreversible, we either could Observe 
adaptation only once in an organism's lifetime, or else successive adaptations would 
drive responding to lower and lower levels until it finally disappeared altogether. 
A parallel case for facilitation would similarly lead either to a single instance of 
facilitation in the organism’s lifetime or to LEE হা unlimited increases in 
elicited responding. None of these alternatives seems plausib/e. p 

Yet ne the following experiment by Dill (1974) with a tropical fish, the 


5 io’: from a stimulus 
Zebra i j ed the zebra danio s escape response a St 
Yb: REST fa larger predatory fish. As this stimulus 


designe rapid approach 0 3 L 
le RL AE from it (despite its aquatic medium, this 


approached, the danio began to swim away ্‌ 
swimming response of the fleeing fish could aptly be called a flight reaction). The 


latency with which swimming began (or the threshold for size or rate of 
the Stimulus) decreased with repeated experiences; this is Se to re য 
the stimulus more and more strongly elicited the escape response. Ut a teste 

after 10 days without further elicitations, He resp had SEE 
earlier values. The change produced by the stimulus was not i) i 
Or potentiation are therefore not appropriate names for-theveffect.How then shall 


We treat it, and what are its implications? k | 

“Practice makes perfect” isa familiar sayin 
behavior. Before the effects of the consequences ye 
Was believed that the mere repetition of respond 
quences, was sufficient to maintain behavior. 
Sechenov: 


g about the role of repetition in 
f responding were appreciated, it 
ing, without regard to its conse- 

Consider the following from 


) 2 ugh, sneeze and swallow immediately upon birth. 
The act LL HE elores to this category HE SEA i 0 
Indeed, everybody knows that a new-born child is able suck. টি - Moreover, 
it is a well-known fact that the activity of this comp ex anism in the 
infant is called forth by irritation of the lips; re a eh SE finger, 
OF a candle, ora wooden stick betwee, the child's 5 it will begin to 
suck. Try to do the same witha child three months after it has been weaned, 
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i i longer do so; however, the ability to produce sucking 
TEE ক retained by man for life. These are highly IT 
facts: on the one hand, they show that the conduction of sensation Es 
lips to the central nervous mechanisms which produce the ET 
apparently ceases in the child after weaning; on the other hand, they in or 
that the integrity of this conduction is maintained by the frequent repe 
of the reflex. (Sechenov, 1863, pp. 28-29) 


Sechenov here emphasizes the complexity of the response in the sucking 
reflex, but more important is his observation on the role of repetition. Not RR i 
repetition important in maintaining the response, according to Sechenov, et 
response also becomes independent of the effects of eliciting stimuli. In the infan 
sucking is elicited by stimuli (“irritation of the lips”); later, these stimuli no longe 
elicit the response, but the organism through adulthood remains able to 2 
the response even in the absence of these stimuli (cf. Schoenfeld, 1966, Segal, 
1972). In our vocabulary of stimulus-response relations, we might say that the 
repeated elicitation of a response increases the likelihood that the response will be 
emitted. 

Early accounts of learning (e.g., Thorndike, 1913; cf. Verhave, 1967) treated 
effects of response repetition as basic components of learning, described in terms 
of laws of exercise or laws of practice. These laws were often ambiguous on such 
questions as whether it mattered if the repeated response were elicited or emitted. 
In any case, they soon were overshadowed by other concerns. As the psychology 
of learning turned to other phenomena, such as reinforcement and stimulus control, 
the possibility of a role for exercise or practice became neglected. The evidence is 
too scanty to allow firm conclusions about whether exercise or practice might be 
a basic component of learning. Nevertheless, some tantalizing data exist, and some 
Speculation is therefore in order. 

Some of the earliest responses in an Organism’s lifetime occur spontaneously, 
€ absence of identifiable eliciting stimuli. For example, inside its egg the 
Yo chick makes uncoordinated movements of its limbs and body. These move- 
ments may prevent the developing bones from becoming 
Or may modify the form of the growing bones and connect 
embryo’s development, eliciting effects of stimuli appear, 
embryo’s sensory apparatus matures. This progression from Spontaneous responding 
to elicited responding may be summarized by saying of chick embryos that “they 


‘act’ before they ‘react.’” (Provine, 1976, p. 210). Stimuli become more important 
later, as when the chick’s 


rotating movements in pecking its way out of its shell 
during hatching are affected by whether the chick continues to encounter intact 
portions of the shell. 
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fixed in their sockets, 
ive tissue. Later in the 
perhaps simply as the 


Evidence suggests that pecking in the Young chick depends not only on the 


S and the consequences of earlier pecks but also on 
how much pecking the chick has already en. 
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chick. These differential consequences will affect the accuracy of the chick's later 
pecking only as the response becomes independent of its eliciting stimuli. That 
development corresponds to one possible formulation of a law of exercise: once a 
response has been elicited by a stimulus, the response may become more likely even 
in the absence of the stimulus? (cf. Spalding, 1873, on the relation between instinct 
and learning). 

In experiments on salivation, dogs at first salivate only when food is 
Presented, but after several food presentations they begin also to salivate 
Occasionally when food is absent (e.g., Zener and McCurdy, 1939). “Such 
responding, called spontaneous salivation, has been attributed to conditioning of 
the salivary response to features of the experimental setting. For example, it might 
be argued that the eliciting stimulus was a spot on the wall that the dog just 
happened to see when food deliveries began. No other eliciting stimuli are identifi- 
able, so the only way to salvage the treatment of salivary responses as part of a reflex 
relation is to assume that salivation could be elicited by such arbitrary features of 
the experimental setting. Yet the problem of identifying a stimulus simply a 
Once we admit the possibility that responses can occur without eliciting stimu a 
fact, we cannot otherwise conceive of emitted responding at all. The concept 0 
emitted responding will be essential to our treatment of consequential operations 
Such as rei unishment. | 
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im ing shock. 
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nO longer be sure of the evidence upon which Sechenov based his conclusion (e.g., 
did he observe one child or many children?). But we can consider a contemporary 
example, in Zelazo, Zelazo and Kolb (19723, b). hl 
These investigators worked with the walking reflex, which appears in 
newborn infants and disappears at about eight weeks. The response of the walking 
reflex has much in common with the behavior that the child engages in later when 
learning to walk. Coordinated walking movements resembling those of an adult 
can be elicited by holding the infant under the arms and letting its feet touch a 
level surface. In one of four groups of six male infants each, parents exercised their 
infant’s walking reflex by holding him so as to elicit the walking response; four 
brief sessions of exercise were held daily, for a total of ten minutes per day (active- 
exercise group). In a second BrOoup, parents exercised their infant’s limbs for the 
same time per day, by moving the arms and legs with a gentle pumping motion 


on his back; the walking response, however, was not elicited 
(passive-exercise group). The infants in the two 


exercised systematically. These served as control 
testing of the walking reflex and of the regular 
infants during exercise. 


remaining groups were not 
Broups, to assess effects of weekly 
periods of attention given to the 


On the average for all infants, the walking response was elicited at a rate of 
five to ten responses per minute during the first week of life. For all but the active- 
exercise group, this responding dropped to a rate of less than four per minute by 


elicitation of the walking response 


se in the passive-exercise group, and 
ontrol groups. Thus, exercise of the 


Walking among the active-exercise infants seemed to Progress from a reflexive 


to an instrumental Tesponse. There is little doub i red. 
The behavior of one S-weekold i ET 


0 y infant, in particular, is difficult to dispute; 

ন ক Vigorous walking motions while the Observer Suspended him in 

midair over the table... . Not Only were there more responses by the active- 

oR SE but they were better executed. (Zelazo Zelazo, and Kolb, 
pl ’ s 
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response may raise its probability of emission. This formulation differs from the 
classical laws of exercise or practice to which we have related it. With the evidence 
at hand, we cannot say how it will stand up to future research. Nevertheless, such 
4 process is undoubtedly the simplest that could enter into instances of learning, 
because it depends only on the simplest of experimental operations: the presenta- 
tion of stimuli. 


SUMMARY AND EXTENSION 


Up to this point, we have concentrated on how stimuli affect responding. Virtually 
all of the phenomena considered so far can be defined and described in terms of 
response probabilities: in a reflex relation, a stimulus raises the probability of a re- 
Sponse to near 1.0; in reflex inhibition, a stimulus reduces the probability of a 
response; in adjunctive behavior, the elicitation of one response changes the proba- 
bility of some other response; in adaptation, response probability decreases 
Over successive stimulus presentations; in facilitation, response probability increases 
Over successive stimulus presentations; and in the possibly controversial 
Phenomenon of exercise, successive elicitations raise response probability in the 
absence of the stimulus. We now examine an outcome of stimulus presentations, 
imprinting, that must be discussed in terms other than the effects of the stimulus 
On response probability (cf. Lorenz, 1937; Hess, 1973). Imprinting includes some 
effects of stimulus presentation that provide a bridge to the treatment of response 


Consequences in the next chapter. 


Stimulus presentations in imprinting 


rst moving thing it is likely to see is its 
mother, and even on this first day of its life outside the egg the duckling will 
Probably begin to stay close to her. But if the mother is gone and the duckling 
first sees something else in motion, such as a human, the duckling will behave 
toward this stimulus as it otherwise would have toward its mother. Such stimuli 


Are said to be imprinted, i.e.,in a figurative sense, stamped into the duckling. 
rated in both laboratory and field with a variety 


Imprinting has been demonst > { a var 
Of stimuli ranging from real and model birds to electric trains (some stimuli, of 
Course, work better than others). The development of imprinting is sometimes said 


to have a critical period of one or a few days: if imprinting does not occur during 
this critical period, it may not occur at all.“ Whatever the basis for the critical 
Period, the duckling begins to respond in significant ways to a stimulus, whether 
mother duck, human, or some arbitrary moving object, if the stimulus is introduced 
early enough in the duckling’s life. One of these responses is following the 


When a duckling hatches, the fi 


) suggests that this happens because fear of novel 


er, 1973 bl 
as the birds get older, they make characteris- 


tical period; 


Ordinari প i i long enoug! 
5 arily stay near such stimuli lo ie duck’s life if this later effect of novel stimuli could 
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imprinted stimulus as it moves; it has sometimes been said that this following is 
elicited by the imprinted stimulus, but this usage of the language of elicitation 1s 
misleading. 

We might say that the imprinted stimulus, through its presentation to tie 
duckling in appropriate circumstances, has acquired control over the duckling s 
behavior. For example, if the imprinted stimulus is the mother duck, the duckling 
will follow her about and will emit distress calls in her absence. But what kind of 
control does the imprinted stimulus exert over the following response? Whenever the 
duckling walks toward the mother, it finds itself closer to her; whenever the 
duckling walks from the mother, it finds itself farther away. In other words, the 
natural consequence of walking in different directions is to change the distance 
between the duckling and its mother. If closeness to the mother is an important 
consequence for the duckling, it should be no surprise that it walks toward her 
rather than away from her. It also follows that if the duckling’s environment Is 
changed so that the presence of the mother requires some response other than walk- 
ing, that other response should begin to occur in its place. 

Such an experiment was designed by N. Peterson (1960). On one side of a 
window was a dark compartment containing a moving imprinted stimulus. A duck- 
ling, on the other side of the window, could light up the side with the imprinted 
stimulus by making an appropriate response. It did so even when the response was 
one incompatible with following, such as pecking at a disk located on the wall or 
standing still on a platform near the window. In other words, the critical property 
of the imprinted stimulus was not that it could elicit particular responses, such 4s 
following or pecking or standing still, but rather that it had become an important 
consequence for the duckling and therefore could reinforce or increase the 
frequency of such responses as following or pecking or standing still. In the natural 
environment, following will ordinarily keep the duckling close to the imprinted 
stimulus, but a laboratory environment shows that the duckling will learn to make 
other responses if they are the ones having this important consequence. 

In imprinting, the effects of stimulus presentation are not changes in the 
probabilities of responding; rather they are changes in the significance of the 
stimulus that has been presented. The imprinted stimulus acquires its significance 
for an organism simply by its presentation at a particular time in the organism’s 
life. It begins as a stimulus toward which the organism is relatively indifferent but 


ends as an important stimulus, one that will affect the duckling’s responding when 
it is made a consequence of behavior. 


Motivation and the significance of stimuli 


There are other ways to change the significance of stimuli. For example, if 
we knew that a rat was more likely to eat or drink than to run in a running wheel, 
we might expect that the rat would press a lever more often if its lever presses 
produced food or water than if its presses produced only access to the running 
wheel. While continuing to allow the rat free access to food and water, however, 
we could remove the wheel, or lock it so that it could no longer rotate, thereby 
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preventing the rat from running in the wheel. After a period of deprivation of wheel 
running, we might then find that the rat would be more likely to run than to eat 
Or to drink if the opportunity to run in the wheel was available again, and that the 
rat would press the lever more often if presses produced access to the running 
Wheel than if they produced food or water. In other words, by depriving the rat of 
one or another of these stimuli, we can change their significance relative to one 
another: we can make running more likely than eating, or drinking more likely 
than running, or eating more likely than drinking, depending on the rat’s prior 
deprivation of each. 

Changes in the significance of stimuli are the basic concern of the study of 
motivation (e.g., Bolles, 1975). Stimuli may be made more or less reinforcing or 
more or less aversive, depending upon such factors as the time since their last 
Presentation. Food, for example, becomes more reinforcing as time passes since the 
last opportunity to eat, and it may even become aversive after eating if an unusually 
large quantity is consumed. (The significance of stimuli can also be changed 
through means other than deprivation; for example, conditioned reinforcers and 
conditioned aversive stimuli are stimuli that have acquired their reinforcing or 
aversive properties through their relation to other stimuli. For a discussion of the 
interaction between motivation and adaptation effects, see R. L. Solomon and 


Corbit, 1974.) 

We have reviewed many aspects of the sti 
are ready to move on to the consequential opera 
ment. The effects of stimuli may be important to our understanding of conse- 
quences. Whenever responses have consequences, these consequences will also have 

hanges in responding that can be produced 


their stimulus effects. With all of the 
by presenting stimuli, we should not expect to understand consequences without 


taking them into account. 


mulus-presentation operation, and 
tions of reinforcement and punish- 


Consequences 


of Responding: 
Reinforcement 
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Through the Indo-European root, sekw-, to follow, consequence is linked to 


si s ls 4 
5 and designate, from the Latin signunm, something that one follows, and 
ocial and association, from the Latin socius, 2 companion or follower. It 


Fa its prefix, con-, with conditioning, contingency, and contiguity. Condi- 
ning, through the Indo-European root deik-, to Show or pronounce, has 
how or instruct; judg- 


Pen oe teach, from the Old English tEcan, to § 
the 0 rom the Latin judex, he who pronounces the law; and paradigm, from 
cont reek para, beside, and deiknunai, to Show. Contingency, from the Latin 
are ingere, to touch on all sides, to happen, has varied meanings: a possibility; 
cls ondition of being dependent on chance; something incidental to something 
Cc 2 Like contact, it combines the roots com-, together, and tangere, to touch. 
ontiguity, the condition of touching or being in contact, has the same 
Origins. Curiously, contingency and contiguity are usually contrasted in psy- 
chology: contingency, in its technical use, stresses how the likelihood of one 
event may be affected or caused by other events, whereas contiguity simply 
HEE the juxtaposition of events in space or time, without regard to causa- 
on. 


R environment can do things to an organism, but the organism in turn can do 
ithe, to its environment. An organism's behavior can have consequences. A rat 
On food may find food by going to some other location. Merely by moving 
fo. One place to another, the rat changes its environment from one without food 
ne with food. But food as a consequence may also change the rat's future 
avior: the rat may go there again later when deprived of food. It is an important 


ac j 
t of behavior that it can be affected by its consequences. 
havior is not simply Pre 


e relation to behavior. 
will be followed by specified events. 


he natural environment. Even before 
onments by doing things or by going 
er study how consequences affect 
the laboratory. We can build a 
r after making an appropriate 


senting stimuli. Rather, 
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which the pigeon pecks the key. In both cases, responses have consequences and the 
consequences may affect future responding. 

This chapter begins by exploring the historical development of experiments 
on the consequences of behavior. It then treats some properties of the contemporary 
concept of reinforcement: the relativity of reinforcement, the significance of dis- 
continuing reinforcement (extinction), the range of consequences that can be 
reinforcing, and the range of responses that can be reinforced. It concludes by 
showing that reinforcement is relevant to simple sensory-motor interactions like 
the visual consequences of eye movements as well as to behavior maintained by 
physiologically significant consequences such as food or water. 


THE LAW OF EFFECT 


Chapter 2 considered Thorndike’s experiments with animals in problem boxes, 
from which the animals could escape by operating an appropriate device that 
released the door. Typically, the food-deprived animal was placed inside the box 
with food available outside. In its varied activity, the animal sooner or later 
Operated the device that released the door and was then free to leave the box. With 
tepetiiion the animal operated the device more and more rapidly upon being placed 
in the box. The response, at first of low probability, became more and more 
probable as it repeatedly had the consequence of opening the door. Thorndike 
described how the consequences of responding affected later responding in terms of 
a principle he called the Law of Effect. The law went through many revisions, but 
its essence was that responses could be made more probable by some consequences 
and less probable by others. In a language closer to Thorndike’s, responses with 
satisfying effects would be stamped in whereas those with annoying effects would 
be stamped out. ! 

Figure 4-1 illustrates data from one of Thorndike’s cats. To escape from this 
box, the cat had to pull a string that ran from a wire loop at the front of the box 
to a bolt that held the door. The first time in the box, the cat took 160 seconds 
to escape. This time decreased gradually and irregularly over successive trials until 
the cat was reliably escaping in less than 10 seconds during the last few trials. 
This process, the gradual decrease in the time taken to complete a task, came to be 


called trial-and-error learning (Kohler contrasted this gradual change with the 
sudden or insightful solutions he observed with chimpanzees). 


Mazes and learning curves 


In later years, trial-and-error learning was studied with many different 
Organisms in many different kinds of situations. Experimenters believed that the 
intelligence of different species could be compared by how rapidly learning 


!This version Was called the stron, 
second half of the Law, leaving the incre 
decrease in probability or Stamping out. 
Effect. This historical point will be relev 


8 Law of Effect. Later, Thorndike repudiated the 
ase in probability or stamping in but discarding the 
What remained was then called the weak Law of 
ant in chapter 5, when we deal with punishment. 
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End oR OE mazes, runways, and other diverse apparatus (e.g., 
etre সি 4 oodworth and Schlossberg, 1954). Apparatus design began to be 
ন Boron questions: whether learning took place in discrete steps, on 
সা a or instead occurred gradually and continuously; whether 
tinnitus leg Te HONE (response learning) or properties of the environment 
Okonly iad ক whether the consequences of responding led directly to learning, 
tt ্‌ি the Organism perform so as to demonstrate what it had learned in 
Stell rt ome of these issues were the basis for long-lasting controversies (Gold- 
’ Z, and Rains, 1965). 
ils a SE feature of these various experiments was that responding could be 
as EG probable when it had certain consequences. The change in probability 
Giaphs sh red differently depending on the apparatus and the experimental aims. 
leaming Png how behavior changed in the course of an experiment were called 
Gg, Fi le the time to complete a response as 4 function of number of trials 
reachin Bure 4-1); the percentage of correct responses; the proportion of animals 
these i some criterion of successful performance at different stages. Sometimes 
in SUE were transformed to ease comparisons among them. For example, 
Sabo Ssive runs of several rats through a maze, each rat’s time to run from 
X to goalbox decreases while each rat's percentage of correct tums and the 


pro i f 
Portion of rats making errorless runs increases. Converting the time to run 
al of the running time, made it 


thro 
Danie the maze to speed, defined as the reciproc 
ai le to compare three different measures all of which increase with learning. 
the shapes of learning curves depended so much on the apparatuses used and 
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the measures taken that the progress of learning could not be described in any 
unitary way. 

The problem was that the performances in these experiments were compli- 
cated. For example, measuring the time course over which a rat stopped entering 
blind alleys as it learned its Way through a maze did not show how learning 
proceeded at a single choice point. This consideration led to the gradual simplifica- 
tion of mazes, as illustrated in Figure 4-2. Diagram A shows the plan of the earliest 
maze used for the study of animal learning (Small, 1899-1900). The maze was a six- 
by eight-foot modification of the hedge maze at Hampton Court in England. 
(Curiously, such mazes may also have provided the setting for the hydraulically 


choice at 1 was learned more quickly than the choice at 7, the difference might be 
due either to the early and late 


to the different floor- 


’ 


» rather than after a Correct right turn at 3; is it 
of the maze rather than near its beginning or its 


Start 


c 

mazes in the study of animal learning: A. the 
Small (1899-1900); B. a U-maze with six 
T-maze; and D. the runway or straight alley. 


Figure 4-2. Stages in the evolution of 
Hampton Court maze, as adapted by 
choice-points; C. the single-choice-point 


Figure 4-2. Now no errors Were even possible, and the measures of behavior were 
reduced simply to the speed with which the rat moved from startbox to goalbox. 

There were other problems. An average measure of the performance of a 
group of animals did not necessarily represent the performance of individuals in 
the group. Suppose, for example, that for single rats in a simple T-maze frequent 
errors typically change abruptly after several trials to consistently correct turns, but 
this change occurs on different trials for different rats. In a large group of rats, 
65% might make correct turns by the fifth trial, and 72% by the sixth, and 79% 
by the seventh, and 83% by the eighth, and so on, until performance became stable 
at 98% to 100% by the twentieth trial. This group performance would give the 
appearance of a gradual increase in correct turns, and would completely obscure the 
abrupt change in performance in the behavior of individual rats (cf. Sidman, 1952; 


Estes, 1956). 

Even the simple runway was not an ultimate solution, because the speed of 
running down a straight alley could be affected by many trivial factors. If trials 
began with the opening of a startbox door, speed might be affected by the direction 


the animal was facing when the door opened. Speed might also be affected by the 
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experimenter’s handling of the animal when moving it between trials from the 
goalbox to the startbox, or by odor trails left by other animals, or even by the 
room in the goalbox if it forced the animal to slow down so as not to bang its head 
against the goalbox wall. 

With mazes or runways, the experimenter had to return the organism from 
goalbox to startbox to begin a new trial, and was only indirectly recording behavior 
when measuring time taken to get from startbox to goalbox. In such cases, the 
experimenter rather than the organism determined when behavior could occur, and 
the overall time measure was not equivalent to specifying in detail the particular 
Sequence of responses on a given trial. Two experimental innovations helped to 
solve these problems. First, an apparatus was designed in which the organism could 
repeatedly emit an easily specified response without the experimenter’s interven- 
tion. Second, responding was measured directly, by its rate or frequency, rather 
than indirectly by other measures derived either from complex response sequences 
Or from groups of organisms. Inspired partly by an interest in reducing the handling 
of the organism and thereby simplifying the work of the experimenter, these were 
important features of a direction of research initiated by Skinner (1930, 1938, 
1950; see esp. Skinner, 1956, for a history of these developments). 


Experimental chambers and cumulative records 


Two representative apparatuses are illustrated in Figure 4-3: a standard rat 
chamber with a single lever on the left and a three-key pigeon chamber on the 
right. They share response devices (the rat’s lever and the Pigeon’s three keys), 
mechanisms for delivering reinforcers such as food or water (a pellet dispenser for 
the rat and a grain hopper for the pigeon), and stimulus Sources (a speaker and a 
lamp for the rat and lamps or projectors behind each of the keys for the pigeon; 


are ordinarily sufficient. Once feeder training is completed, the apparatus is 
changed so that pellet deliveries depend upon lever presses. Sooner or later the rat 
» this lever press produces a food pellet, and the food pellet 

will then probably £0 back to the lever and press it again. 
for the lever press are considered in chapter 6.) The 


Figure 4-3. A rat chamber (left) and a three-key pigeon chamber (right). The rat 
chamber includes a lever (A), a food cup and pellet delivery tube (B), a speaker (C), 
a lamp or houselight (D), and a grid floor through which electric shock can be 
delivered (E). The pigeon chamber includes three keys (F, G, H) and the opening 
to a food hopper (D. Lamp each key allow colors or patterns 


to be displayed on the keys. 


s or projectors behind 


outcome of interest is the frequency with which the rat presses the lever. If this 
frequency increases, we call the food pellet a reinforcer. In the type of chamber 
shown in Figure 4-3, other kinds of reinforcers could be substituted. For example, 
the pellet dispenser might be replaced by a dipper that delivers small quantities of 


Water or milk. 
A pigeon chamber differs from a rat chamber in that keys are substituted for 
levers and the feeder accommodates the pigeon’s standard diet. The key is simply 
d hole in the chamber wall. One 


a piece of plastic mounted behind a roun! : h J 
component of the key is a switch that allows electrical recording of the pigeon’s 


Pecks if they are forceful enough (the keys are routinely sensitive to forces of less 
than 0.1 Newton, which is about 10 grams or one-third of an ounce; for pigeons’ 
key pecking, this sensitivity is adequate for most purposes). The plastic is usually 
translucent, so that lamps of miniature projectors behind the key can project 
patterns or colors on it. The pigeon chamber in Figure 4-3 contains three keys, 
arranged horizontally about 23 centimeters (9 inches) above the chamber floor. 
Any particular experiment might use only a single one of these keys, some 


combination of two, or all three. Typically, keys are lit when they are in use. 
The opening to the feeder is centered below the keys. The feeder consists of 


a tray of pi d (mixed grain or commercially available pigeon pellets) that 
Hoon ¢ h when operated. Like the delivery of the 


can be brought within the pigeon’s reac 
0 n feeder is accompanied by characteristic 


rat’s pellet, the operation of the pigeo 
Sounds; it is also common practice to light the feeder and turn off all other chamber 


lights whenever the feeder is operated. The chamber typically includes other 
features, such as a houselight for dim general illumination, sources of masking noise 
and of other auditory stimuli, ventilation and temperature control, and so on. 

As with the rat, if a food-deprived pigeon’s pecks on a key produce food, we 
may expect an increase in the rate at which the pigeon pecks that key. In these 
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apparatuses, the opportunity to eat is used to raise the probability of a response. 
Whether the response is a rat’s lever press or a pigeon’s key peck, the important 
common property of these arrangements is that responses can have consequences 
and that these consequences can change behavior. The rat and the pigeon are 
common laboratory organisms. They each have idiosyncratic species-specific 
behavior patterns that must be taken into account, and we must not take it for 
granted that what we observe with rats or pigeons can be generalized to other 
organisms. Nevertheless, their diet, housing, susceptibility to disease, and other 
characteristics are reasonably well understood, and their small size, relatively 
long life-span, and economy make them particularly convenient. Thus, we will 
find that they have often served in experiments on the consequences of responding. 

The responding in apparatuses like those in Figure 4-3 has sometimes been 
called free-operant responding: free because the organism is free to emit the 
response at any time rather than waiting for the experimenter (as when the rat in 
a maze goalbox must wait to be placed back in the startbox before it can run 
through the maze again); and operant because the response operates on the environ- 
ment. Free-operant responding lends itself to a recording method, the cumulative 
record, that provides a detailed and convenient picture of how responding changed 
over time. In a cumulative recorder, illustrated in Figure 4-4, a roll of paper is 


Roller Pen Drive Pen Direction of 
Pen Movement 


Direction of 
Paper Movement 


Cumulative 
Record 


Figure 4-4. Principal components of a cumulative recorder. The roller drives the 
paper at a constant speed, and each response moves the pen a fixed distance across 
the paper. The paper speed and the step-size for each response vary with the 
behavior under study. A common scale is about 1 centimeter per minute (about 2.5 
minutes per inch) and 1100 responses across the width of the paper (about 200 
responses per inch). At this scale, a slope of 45° represents a rate of about 40 


responses per minute. When the pen moves to the top of the paper, it resets auto- 
matically to its starting position near the bottom. 
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A small section of record C that includes a few responses is magnified in 
record D, in which the magnified cumulative record is accompanied by an event 
record on the same time-scale. For each response in the event record, a small step 
Occurs in the cumulative record; this property of cumulative records is not usually 
Obvious because typical response and time scales are too small for such fine resolu- 
tion of detail. Nevertheless, different patterns of responding can be distinguished 
in cumulative records. For example, the rates of responding in records E and F 
are roughly the same but record E is steplike whereas record Fis relatively smooth. 
This means that the responding that produced record E occurred in short high-rate 
bursts (Steep segments) separated by pauses (flat segments), whereas the responding 
that produced record F occurred at a much more uniform rate from moment to 
moment. This property of cumulative records is sometimes called grain; of the two 
records, E has the rougher grain and F has the smoother one. 

Records G and H provide other examples of the detailed properties of 
behavior made visible in cumulative records. In record G, the rate begins at roughly 
25 responses per minute, but as time passes it gradually decreases; in record H, 
the rate changes in the opposite direction, increasing from a relatively low rate to 
roughly 30 responses per minute (such patterns of responding are respectively 
called negatively and positively accelerated, because the slopes of the records 
Tespectively decrease and increase over time). 

Figure 4-6 shows additional features sometimes incorporated into cumulative 


records. Records A and B show how displacements of the response pen can be used 
to indicate other events besides res 


Produced food, irre 
The repeated conc 
called scalloping. I 
was followed by s 


ponses. In these records, only some responses 
gularly in A (as at a, b and C), and regularly in B (as at d and ¢). 
Ave pattern in B, as illustrated between d and e, is sometimes 
n record C, responding that began at f and continued through 8 
Ome event (e.g., food delivery) indicated by the pen displacement 
at §. The pen then reset to h and the sequence repeated, as at h to i and so on. This 
type of record makes successive Segments easy to compare (e.g., mors 
Occurred in the segment ending at g than in the one ending at i). Record D 
illustrates how sustained pen displacements can distinguish different conditions. 
For example, in the Presence of a tone the pen may be in its normal position, as in 
segments J, 1, and n, whereas in the absence of the tone, the pen may be displaced 
downward, as in segments k, m, and 0. 

With this treatment of free-operant behavior and cumulative records, we have 
red part of the technological development of the science of behavior. Before 
we can effectively consider the findings made available through such analyses, we 
must turn our attention to some aspects of the language of behavior. 
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When a food-deprived rat’s lever Presses produce food, lever pressing becomes more 


S do not produce food. When a water-deprived 
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g Successive segments of the record (f to 
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responding in the presence of one stimulus (at j, l, and n) from nonresponding in 
the presence of another stimulus (at k, mM, and 0). In this figure, a slope of 45° 
represents a rate of about 40 responses per minute (note that this scale differs from 
that in Figure 4-5). 
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Figure 4-6. Additional features of ¢ 


Parent’s attention, crying becom e than when it does not produce 
attention. These cases illustrate the principle of reinforcement: when responding 
Produces reinforcers, responding increases. The principle is simple, but as it evolved 
from Thorndike’s initial versions of the Law of Effect to its contemporary status it 
carried problems of language and logic with it (Catania, 19690, 19736). Some 
Properties of the contemporary vocabulary of reinforcement are summarized in 
Table 4-1. 

The vocabulary of reinforcement i 


es more probabl 


ncludes the term reinforcer as stimulus and 


the terms reinforce and reinforcement 4s either operation or process. For example, 
When the rat’s lever presses produce food pellets and lever pressing increases, we 
may say that the pellets are reinforcers and that the lever presses are reinforced 
with pellets. Although a reinforcer is a kind of stimulus, reinforcement is neither 
Stimulus nor response. Instead, reinforcement as an operation is presenting a 
reinforcer when a response occurs. This operation is carried out on responses, and 
therefore we speak of reinforcing responses rather than of reinforcing organisms. 
We may say that a food pellet reinforced a rat’s lever press or that a pigeon’s key 
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TABLE 4-1 


The Vocabulary of Reinforcement.! This vocabulary is appropriate if and only if 
three conditions exist: (1) a response produces some consequence; (2) the response 
occurs more often than when it does not produce that consequence; and (3) the 


increased responding occurs because the response has that consequence. 


Term 


Restrictions 


Examples 


reinforcer (noun) 


reinforcing (adjective) 


reinforcement (noun) 


to reinforce (verb) 


A stimulus. 


A property of a stimulus. 


As an operation, the 
delivery of a consequence 
When a response occurs. 


As a process, the 
increase in responding 
that results from the 
reinforcement operation. 


As an Operation, to 
deliver a Consequence 
When a response occurs; 
Tesponses are reinforced 
and not organisms. 


As a process, to increase 
responding through the 
reinforcement Operation. 


Food pellets were used as 
reinforcers for the rat’s lever 
presses. 


The reinforcing stimulus was 
produced more often than 
the other, nonreinforcing 
stimuli. 


The fixed-ratio schedule of 
reinforcement arranged food 
deliveries after every tenth 
key peck. 


The experiment with 

monkeys demonstrated 
reinforcement produced 
by social consequences. 


When a period of free play 
Was used to reinforce the 
child’s completion of class 
assignments, the child’s 
grades improved. 


The experiment was designed 
to find out whether gold 
Stars would reinforce 
cooperative play among 
first-graders. 


1 parallel vocabulary i 
punish as a verb; cf. ch 


S appropriate to 
apter 5), with the 
Sponding occur less rather than more often. 


punishment (including punisher as a stimulus and 
difference that a punishing consequence makes re- 
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that a response was reinforced can be interpreted in two ways: either the response 
Produced a reinforcer (operation) or responding increased because it produced a 
reinforcer (process). This text favors the usage of reinforcement as an operation. 
The process that follows from this operation can be described so easily in terms of 
changes in responding (e.g., response probability increased) that there seems little 
Justification for substituting other terms for a direct description of what happens 
to responding. Nevertheless, the process usage has so much precedent that we will 
be unable to avoid it completely. In addition, the overlap between operation and 
Process vocabularies extends to a variety of other terms in learning (e.g., extinc- 
tion, punishment, conditioning; cf. Ferster and Skinner, 1957; Catania, 1968). 

The vocabulary of reinforcement leads to some logical difficulties even when 
restricted to experimental operations. When a response becomes more probable 
because it has produced a stimulus, we say the response has been reinforced and we 
Call the stimulus a reinforcer. If asked how we know that the stimulus was a 
reinforcer, we can point to the increase in responding. If then asked why the 
increase occurred, we may say it did so because the response was reinforced. At 
Some point, we begin to repeat ourselves. Once we define a reinforcer by its effect 
On behavior, we create a problem of circular definition if we simultaneously define 
the effect by the reinforcing stimulus (cf. Meehl, 1950). K চ 

One solution is to recognize that the term reinforcement is descriptive rather 
than explanatory. It names a particular relation between responses and the environ- 
ment, but it does not explain that relation. The relation is complex and must 
include at least three components. First, a response must have some consequence. 
Second, the response must increase in probability (i.e., the response must be more 
Probab]e than when it does not have this consequence). Third, the increase in 
Probability must occur because the response has this consequence, and not for 
Some other reason. For example, if we knew only that a response had become more 
Probable, it would not be appropriate to SAY that the response must have been 
reinforced; the response might have been elicited by a stimulus. It would not even 
be sufficient that the response was now producing some stimulus it had not been 
Producing before. We would still have to know whether responding increased be- 
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Apostate olerperiment witha newborn sibling by pinching the 
infant whenever it cries. At first the infant might lie quietly, but if the older child is 
Patient the infant will sooner or later Cry. Then comes the pinch, but the pinch will 
be followed by more crying, and thus by more pinches, and thus by more arying, 
and so on. We assume that this procedure will inevitably attract the parents’ atten- 
tion and that they will intervene to rescue the infant. At that point the pinching 
may have consequences for the older child, but we must leave the scene before the 
Parents arrive to get back to the language of reinforcement. Ry: 

In this example, a response had a consequence and responding increased. Two 
Of our criteria for reinforcement were therefore satisfied. Yet we would be 
reluctant to call the pinches reinforcers or to say that the child had reinforced the 
infant’s crying with pinches. Our reason is that we do not believe that the increase 
in crying came about because of the consequential relation between crying and 
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pinches. We know enough about infants and crying and pinches to recognize that 
the pinches would have made the infant cry whether or not the pinches were pro- 
duced by crying; the pinches by themselves would have made the infant cry even if 
they had been delivered without reference to crying. Our conclusion is that crying 
Occurred because it is ordinarily elicited by pinches and not because it produced 
pinches. It was incidental that the first cry happened to lead to the first pinch; the 
infant cried when pinched whether or not it brought on the pinches itself. When- 
ever responses produce stimuli as consequences, the stimuli may have eliciting 
effects along with or instead of their effects as consequences of responding. 

The vocabulary of reinforcement requires that the response have a conse- 
quence, that responding increase, and also that the increase Occur because respond- 
ing has its consequences and not for other reasons. Once these conditions are satisfied, 
we may describe the circumstances by saying that the response was reinforced and 
that the stimulus was a reinforcer. We may also assume that the stimulus will continue 
to be effective as a reinforcer in the future, and that it will reinforce other 
responses in other situations. Either assumption, however, may be incorrect. The 
effectiveness of reinforcers can change over time, and some stimuli may reinforce 
Some responses but not others. For example, in everyday conversation saying 
“you're welcome” may reinforce someone else’s “thank you,” but money rather 
than “youre welcome” is more likely to reinforce other kinds of behavior, such as 
the services of a plumber or an electrician; by itself even “thank you” might not be 
effective. Despite these reservations, the reinforcers used in many standard experi- 
mental situations (e.g., food with food-deprived organisms) are effective reinforcers 
for a variety of responses; the experimenter who chooses a stimulus that will 


reinforce some responses but not others will sooner or later confront the difference 
and have to cope with it. 


The relativity of rein forcement 


The preceding discussion buttresses the logic of the vocabulary of reinforce- 
ment, but does not tell us how to identify reinforcers independently of their effects 
in the consequential Operation of reinforcement. Without making a stimulus a 
consequence of responding we cannot Say whether or not the stimulus will be 
effective as a reinforcer. Even demo! 
ith food and water continuously 
available to a rat, lever Pressing might not be much affected by making food or 
Water a consequence of lever presses. (In chapter 3, these effects were discussed in 


Consequences of Responding: Reinforcement 77 


effects on behavior, but we may find it useful to examine behavioral effects of 
reinforcers, such as the responding they occasion. 

When a rat’s lever press produces food, the food gives the rat an opportunity 
to eat. If we make both lever and food available to the rat at the same time the rat 
is more likely to eat than to press the lever. This kind of observation led to the 
hypothesis that the probability of one response will increase if that response gives 
the organism an opportunity to engage in another response more probable than the 
first (Premack, 1959, 1965, 1971). In other words, if response A is more probable 
than response B, an opportunity to engage in response A can be used to reinforce 
response B. According to this account, food is an effective reinforcer for a food- 
deprived rat’s lever presses simply because eating is more probable than lever 
Pressing. 

Consider an experiment that demonstrated how the reinforcing effects of two 
Stimuli could be reversed by independently varying the probabilities of the 
responses occasioned by these stimuli (Premack, 1962). A rat’s running in a wheel 
Was controlled by engaging or releasing a brake on the wheel, and was measured in 
number of revolutions. The rat’s drinking from a drinking tube was controlled by 
moving the tube into or out of an opening in a stationary wall on one side of the 
Wheel, and was measured with a drinkometer, an electrical system that recorded 
licks. As tested during brief periods when both responses were available, running 
became more probable than drinking when the rat’s opportunity to run was 
restricted while water remained available, but drinking became more probable than 
Tunning when the rat’s opportunity to drink was restricted while the rat continued 
to have access to the wheel. In both cases, the opportunity to engage in the more 
Probable response was a consequence that could be used to reinforce the less 
Probable response. When running was more probable than drinking (after restricted 
Access to the running wheel), licks became more likely if they released the brake on 
the wheel and allowed the rat to run than if they had no effect on the opportunity 
to run. Conversely, when drinking was more probable than running a EA 
Access to the drinking tube), running became more likely if it produced the rinking 
tube and allowed the rat to drink than if it had no effect on the opportunity to 
drink, 

This demonstration implies that reinforcers cannot be defined independently 
Of the responses that they reinforce. In Premack’s experiment, drinking reinforced 
tunning when drinking was more probable than running, but running reinforced 
drinking when the probabilities were reversed. According to this account, 
Teinforcers are relative and not absolute. Their important properties are based on 
the responses for which they provide an opportunity. This relativity went unrecog- 
nized for a substantial time in the psychology of learning. Most learning 
experiments restricted attention to responses with relatively low probability (e.g., 
the rat’s lever press) and to reinforcers that occasioned highly probable responses 
€.8., food and eating). These cases Were common and convenient but were special 
Cases nonetheless. No one thought to ask, for example, whether by appropriate 
deprivation conditions a situation might be arranged in which an opportunity to 
Press a lever could be used to reinforce eating. (The example is not so far-fetched. 
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The opportunity to eat a good meal can be an effective reinforcer, but consider 
how often children are persuaded to finish their dinners by making other activities 
depend on that eating.) 

The relativity of reinforcement may be illustrated further by expanding the 
previous experiment to three responses. Let us add a pellet feeder to the activity 
wheel and the drinking tube and by appropriate deprivation operations make eating 
more probable than running and running in turn more probable than drinking. In 
these circumstances, running is reinforced by the opportunity to eat, but the 
opportunity to run reinforces drinking. In this situation, a response can 
simultaneously reinforce and be reinforced. By changing the relative probabilities 
of the three responses, we can make an opportunity to engage in any one an 
effective reinforcer with respect to either or both of the other two.! 

Earlier, we spoke of behavior in terms of hierarchies of responses. The 
ranking of responses within this hierarchy can be altered by stimuli, by the 
Organism’s opportunities to engage in different responses, and even simply by the 
Passage of time. When we restrict an organism’s Opportunities to engage in some 
response (i.e., when we arrange deprivation), we make that response more probable 
Or, in other words, we raise it within the hierarchy. We then may raise the 
probability of other responses lower in the hierarchy by using the opportunity to 
engage in the first response as a reinforcer. 

We introduced reinforcers as kinds of stimuli, but we are now speaking of 
reinforcers in terms of responses. The treatment shifted because we found that an 


important property of a reinforcer was the responding that it occasioned. Premack 
discussed this as follows: 


I have been accused of ignoring the stimulus; in fact, all the response 
probabilities we have measured have involved explicit stimulus operations, 
€.g., pellets, sucrose solutions, pinball machines, manipulanda, activity 
wheels, etc. ... Even though it is desirable to take the position that 4 
response 1s a means of calibrating the value a subject sets on a stimulus, an 
that the precise contribution which the response factors make to value is 2 
matter for research, we should not Swing so far in the other direction as to 
return to the traditional stimulus error. For example, tradition talks about 
the reinforcement value of food. But food has no value; we must talk about 
the value of food when it can be smelled (but not seen), seen (but not 


smelled), both seen and smelled, eaten, etc. It is part of the advantage of the 


Do LES AE to avoid this kind of error automatically. (Premack, 1971, 
PP. হং 


Like the reflex, reinforcement is a relation. The relation includes responding, its 
consequences, and the change in behavior that follows. We need not treat reinforce- 
ment as a theory or a hypothesis; it is Simply a name for this relation. 


1 ৰ 
The detailed operation of Premack’s princi iliti al 
j B ple depends on how probabilities are calc 
lated. Choice among simultaneously available Tesponses seems more Cy than the 
proportion of time occupied by each response (but cf. Dunham 1977) 
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Figure 4-7. Two hypothetical cumulative records of the extinction of a rat’s lever 
presses after food reinforcement. Either A or B might be said to demonstrate the 
Ereater resistance to extinction, depending on whether it is measured in terms of 


the time taken until two minutes elapse without a response or in terms of the total 
Tesponses emitted during the extinction session. 


if we wear a watch we often turn Our wrist to look at it; the consequence of looking 
is finding out the time. But if we Stop wearing the watch for some reason, we 
eventually stop looking; the consequence of seeing a bare wrist is not effective as 
a reinforcer. 

The history of the concept of extinction, however, was not so simple. It came 
to be argued that extinction did not merely allow the passive decline of responding 
but rather that it somehow actively suppressed responding. Extinction was said to 
have inhibitory effects. The view that some process actively reduces responding 
during extinction long dominated the alternative view that responding decreases 
during extinction because it is no longer maintained by reinforcement. Thus, texts 
On learning tended to devote Separate chapters to reinforcement and extinction 
rather than treating them as two aspects of the same phenomenon. 

This treatment of extinction goes back to the inhibitory language established 
in the context of Pavlov’s conditioning experiments (cf. chapter 2; see also Skinner, 
1938, pp. 96-102). That language, developed to deal with combinations of stimulus- 
presentation and stimulus-control Operations, was carried over to consequential 
Operations. Once that had happened, the language was retained because it seemed 
consistent with effects during extinction other than the defining decrease in 
responding. 

Consider the phenomenon called Spontaneous recovery (see also chapter 9). 
In a typical extinction Session, responding begins at a relatively high rate that 
decreases as the session continues. But if the organism is then removed from the 
Situation for some time, the rate at which it begins responding in the next extinc- 
tion session is usually higher than the rate at the end of the last session. Some 
hypothetical cumulative records illustrating spontaneous recovery are shown in 
Figure 4-8. Responding at the beginning of each session is Said to have recovered 
Spontaneously from inhibition built up by the end of the previous session; 
presumably this inhibition builds up within sessions, when it actively suppresses 
responding, and dissipates during the times between extinction sessions. 
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Phenomena such as spontaneous recovery were taken to mean that responding 

that had been reduced by extinction was somehow “‘there all the time but 
inhibited.” But accounts that assumed responding was actively suppressed in 
extinction were criticized on the grounds that they explained extinction in terms of 
unobservable events or processes (R. L. Reid, 1958). For example, when it was said 
that a response had been inhibited in extinction, the inhibited response could be 
measured but not any events or processes that were doing the inhibiting. The 
language of inhibition was not particularly useful if the inhibitors could not be 
Specified. Perhaps for this very reason a variety of competing accounts of extinction 
Were formulated in terms of such inferred processes as inhibition, frustration, 
interference, and fatigue (cf. Kimble, 1961, chapter 10). These accounts agreed that 
responding did not merely decline passively in extinction but rather was actively 
Suppressed; they differed only in how they characterized the events that generated 
the inhibition. 
Hl Other accounts, however, did not assum 
in extinction. For example, one account © 
stimulus properties of the experimental setting may differ for the organism from 
the beginning to later in the session, perhaps because of lasting effects of handling 
and other pre-experimental conditions. Thus, the effects of extinction late in one 
Session might not transfer to the different and perhaps novel conditions at the 
beginning of the next session. On this basis, Kendall (1965) reasoned that the usual 
Pattern of response rates in sessions of extinction should be reversible under proper 
Conditions. In an experiment with three pigeons, he first reinforced key pecks in 
One-hour sessions. Then he arranged repeated one-minute sessions of extinction. 
Only after responding had reliably decreased to zero in these brief sessions did he 
Introduce the first long extinction session. Within a few minutes, each pigeon began 
to respond. Until this session, responding had never been permitted to extinguish at 
times later than one minute into a session; responding occurred at these later times 
When the opportunity was finally available. In a sense, Kendall had demonstrated 
Spontaneous recovery within rather than at the beginning of a Session. 

Furthermore, the temporal pattern of responding in spontaneous recovery 
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(Figure 4-8) also occurs with responding that has never been explicitly reinforced. 
With rats newly placed in a chamber with a lever, some lever pressing usually occurs 
even without food reinforcement; the rate of this baseline or operant-level respond- 
ing tends to decrease within sessions and to recover between sessions (Schoenfeld, 
Antonitis, and Bersh, 1950). Thus, an inhibitory theory would have to argue that 
the similar effects are only coincidental or to invent some process that could deal 
with why both types of responding change similarly over time. 0 

Contingencies and stimulus deliveries. Nevertheless, accounts of extinction 
remain controversial. This may be because discontinuing reinforcement has two 
effects: a contingency between responses and reinforcers ends, and reinforcers are 
no longer delivered. The term contingency is simply a name for the consequences 
of responding; it is defined as the effect of a response on the probability of a 
stimulus. For example, if a rat can receive food pellets only by pressing a lever and 
each lever press produces a pellet, the contingency is one in which the lever press 
raises the probability of pellet deliveries from zero to 1.0 (no pellet if no lever press 
and pellet if lever press). If the lever press does not produce pellets, the contingency 
is one in which pellet deliveries are independent of lever presses (the lever press 
has no effect on the probability of pellet deliveries). 

Let us now compare procedures in terms of changes in contingencies and 
changes in stimuli. Consider first the food-deprived rat whose lever presses are 
reinforced with food pellets. Once every ten or fifteen seconds the rat presses the 
lever and eats the pellet delivered. If lever pressing is then extinguished, lever 
presses no longer produce pellets and the rat no longer eats. As a result, the rat 
presses the lever less often. This rat is now different in two ways: not only are its 
lever presses no longer having their earlier consequences but this rat is no longer 
eating. 

Here is an alternative arrangement. The r: 
we discontinue the reinforcement of l 
matically every ten or fi 
the previous example: 1 


at remains food-deprived, but when 
ever presses we begin to deliver pellets auto- 
fteen seconds. In this Case, we change contingencies as in 


ever presses that once raised the likelihood of a pellet 
delivery now have no effect. But even though lever presses no longer have their 


earlier consequences, this rat can still eat a food pellet every ten or fifteen seconds. 
This rat too will press the lever less often; on the other hand, it will differ from the 
first rat in some important ways. 

These two procedures distinguish the effects of terminating a reinforcement 
contingency from those of terminating reinforcer deliveries. The standard extinc- 
tion procedure terminates both a contingency and stimulus presentations; in the 
last example, however, the contingency was terminated without also terminating 
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in the chamber. If pellets were produced by lever presses, the rat might bite the 
lever (Mowrer and Jones, 1943); other organisms in the chamber might be attacked 
(Azrin, Hutchinson, and Hake, 1966); and once the procedure makes such 
aggressive responses probable, the opportunity to engage in them may reinforce 
other responses (e.g., the organism might pull a chain if chain pulls produced some- 
thing it could sink its teeth into: Azrin, Hutchinson, and McLaughlin, 1965). These 
effects may be observed in extinction, but are not direct results of terminating the 
teinforcement contingency. They are side effects usually superimposed upon the 
decrease in responding during extinction because extinction is necessariily 
Accompanied by the termination of reinforcer deliveries. 

We have compared extinction, in which both a reinforcement contingency 
and reinforcer deliveries are terminated, with a procedure in which only the 
reinforcement contingency is terminated. Terminating reinforcement deliveries 
While maintaining the reinforcement contingency does not seem logically possible. 
If instead response-independent reinforcer deliveries were terminated, we might 
expect the same kinds of emotional or aggressive TeSponses 85 47 often produced 
by extinction; this finding would add to the argument that such changes in behavior 
Are side effects of changes in contingencies. 

In the analysis of behavior, operations often have more than one effect. A 
Stimulus that is a reinforcer in a given procedure may affect other responses besides 
the one that it reinforces: it may elicit some responses and serve as a discriminative 
Stimulus for still others. In the history of research on extinction, the various effects 
Of terminating reinforcer deliveries may have contributed to its treatment in terms 
Of inhibition. Yet the various phenomena seer as indicating that extinction was 
More than simply the temporary effect of reinforcement Were probably only side 
effects. Many of these phenomena could have been observed in situations that 
did not involve the consequences of responding (e.8., the aggressive responding 
Benerated by terminating reinforcer deliveries). To that extent they are peripheral 


to an account of the effects of contingencies. _ ff 
Extinction and superstition. In extinction, the effects of changes in contin- 


Bencies are confounded with the effects of terminating reinforcer deliveries, because 
the general effects of terminating reinforcer deliveries are superimposed on the 
more specific effects of terminating contingencies. Why then has extinction for 50 
long been the primary basis for studying the effects of terminating contingencies? 
Part of the answer is that disconnecting the lever from the equipment that operates 
the feeder is more convenient than disconnecting the lever and at the same time 
Substituting a clock that will operate the feeder periodically. Yet it is not likely that 
this change in apparatus by itself discouraged further research. Instead, the difficul- 
ties rested with some additional properties of behavior. 

As we saw in chapter 3, the presentation of stimuli can have many effects. 
A stimulus presented independently of behavior may follow as well as be followed 
by responses. For example, suppose that the response-independent delivery of a 
food pellet occurs just after the rat has pressed the lever. What could distinguish 
this succession of lever press and pellet from that in which the lever press actually 
Produces the pellet? Why should this accidental succession of response and rein- 
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forcer not affect the rat’s future responding just as much as if it were the outcome 
of a reinforcement operation? 

In chapter 3, we discussed superstition (Skinner, 1948). Food was presented 
to a food-deprived pigeon at fifteen-second intervals and responses that occurred 
Just before food deliveries were likely to be repeated (and therefore followed 
closely by still more food deliveries). Through the accidental succession of 
responses and reinforcers, stereotyped patterns of responding tended to develop. 
The responses were different for different pigeons (variability) and they usually 
changed gradually in form over successive food deliveries (topographical drift). 
A particular response is not likely to be superstitiously maintained over an 
extended period of time through the accidental correlation of responses and 
reinforcers, but the possible effects of such accidental correlations must be 
considered in any experiment involving response-independent reinforcer deliveries. 

The maintenance of superstitious responding will depend to some extent on 
the temporal spacing of reinforcer deliveries. The Optimal spacing is one in which a 
response made more probable by one reinforcer will be likely to occur again just 
before the next. Eliciting effects of the reinforcer may also contribute, because the 
reinforcer may directly generate responses between reinforcer deliveries (e.g., as 
When pigeons’ pecks are made more likely by presentations of grain: cf. chapter 3; 
Staddon and Simmelhag, 1971; see also Killeen, 1975). 

Superstitious responding generated by an accidental succession of responses 
and reinforcers is a recurrent problem in the analysis of behavior, because such 
accidental sequences can occur whether reinforcers are independent of or 
consequences of responses. For example, if one response is followed by a different 
response that is reinforced, the reinforcer may affect both responses even though its 
delivery depended only on the second response. Even when a response does have 


While remaining alert to the circum- 
applied. 


probably decline only slowly, not necessarily because the 

contingency has enduring effects but rather because the accidental ion of 
responses and reinforcers may counteract the effects of terminati eG 
ment contingency. Eventually lever Pressing will be replaced b 0 € rein 
(cf. chapter 8), but it would be difficult to argue that such ae 7 eves 
Pressing is simply determined. For this reason, the transition from fren forcement 


reinforcement 
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Ccontingenc i 
a ne no response-independent reinforcer deliveries by itself cannot demo 
ects of terminating the reinforcement contingency (cf. Rescorla 


Skucy, 1969; Boakes, 1973). 


When 
responses have consequences, the termination of these consequences 


affe iui 
ETA stimuli are related to behavior and how stimuli are distributed in 
ee ct must be taken into account in an analysis of behavior. The most 
lenporary Ta of extinction is its demonstration that reinforcement has 
Hon End a than permanent effects, but other events also accompany extinc- 
ie afro e superimposed on the decrease in previously reinforced responding. 
ER) to conceive of reinforcement without extinction; the effects of 
ion tell us that organisms are sensitive to the consequences of their behavior 


not o 
nly when these consequences start but also when they Stop. 


HE ACQUISITION OF BEHAVIOR 


xamining how an organism may acquire 


Le 
t us return now to learning, by € 
d A of Figure 4-9 shows a hypothetical 


Ies 
লা through reinforcement. Recor 
ulative record of a rat’s very first session Of reinforced lever pressing. The first 
pauses. After a little more than five minutes 


৭" tne are separated by long Ps £ 
je লা pauses disappear and responding then increases for the rest of the session. 
quisition of lever pressing appears gradual. If we wished to repeat these 
til responding decreased to 


Obse 4 

BToUe Te we might extinguish lever presses un 
1 5 a 
ous levels and then conduct another session of reinforcement. Record B of 


4 4.9 shows what this performance might look like. At the beginning of the 
oc because of the prior extinction, no responding occurs. Finally a response 
00 ogo (and is reinforced), and responding immediately rises to a rate approxi- 
instanc equal to the rate at the end of the first reinforcement SesSi6n;, In TS 
ow e, we observe an abrupt ra gradual acquisition of lever pressing. 
Shall we reconcile these two ver 


ther than a 
y different performances? 


ACQUISITION AND REACOUISITION 


+ Yh f 


10 Minutes 


100 Responses 


Fi Ged 
Des 4-9. Hypothetical cumulative records of a rat s initial acquisition of lever 
and ing during the first session in which each response 1s reinforced with food (4) 
Teacquisition of lever pressing when reinforcement is introduced again after a 


Deri 
Tiod of extinction (B). 
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Sidman (1960) has discussed an alternative to the view that this depends on 
an irreversible change produced by initial learning: 


The animal learned not only those responses that succeeded in depressing the 
lever but also learned to go to the tray, to pick up the small pellet, to bring it 
to its mouth, etc. And these responses were learned in their correct sequence, 
because their reinforcement was correlated with the appropriate stimuli both 
from the environment and from the preceding behavior. The tray approach, 
for example, could be reinforced only after the sound from the food 
magazine; reaching for the pellet could be reinforced only after the pellet 
had dropped into the tray, etc. ... What did we extinguish when we discon- 
nected the feeding mechanism? ... There is no magazine sound, no pellet 
sound, no visual pellet, no tactual pellet, etc. Tray-approach is still possible, 
but only in the absence of some of its controlling stimuli. The responses 
involved in picking up and ingesting the pellet can no longer occur in their 
Originally learned context. While our extinction procedure may have returned 
the lever-pressing response to its preconditioning level, other components 


of the total learned sequence could not have undergone complete extinction. 
(Sidman, 1960, pp. 101-103) 


From this analysis, Sidman could summarize the reasons for the difference in 
acquisition in the two reinforcement sessions as follows: “When reinforcement was 
again introduced..., the animal did not have to relearn the whole sequence, 
because the whole sequence had not been extinguished” (Sidman, 1960, p. 103). 

Reinforcement, then, does not produce learning; it produces behavior. When 
We look to see whether a rat presses a lever when a reinforcement contingency 
Operates and does not press the lever otherwise, we are simply concerned with the 
extent to which the rat has learned the consequences of its lever pressing. The 
consequences of responding are critical to our understanding of learning not 
because learning follows from them but because they are what is learned. Contin- 
Sencies describe how the environment is affected by behavior and are therefore 
important features of the environment for Organisms to learn. 


Latent learning. The preceding issues were implicit in a controversy based 
upon a phenomenon called latent le 


arning (Blodgett, 1 ; Thi i ৰ 
Consider the experiment illustrated in Bo do To ee Fe ‘oa 
see also Tolman, 1948). Food-deprived rats in each of three groups negotiated a 
maze. In one group, the rats found food in the goalbox of the maze, and over 
Successive daily trials their entries into blind alleys gradually decreased Ina 
second group, the rats found no food in the goalbox. For these animals entries 
into blind alle j 


J YS decreased but remained Substantially hi he 
first group. In a third group, food was intr hai weird 


sessions. The performance of th 
(no food), 


Tolman & Honzik (1930) 


Mean Blind-Alley Entries 


Days 
tent learning. Rats were allowed one trial per day 


Figure 4-10. An experiment in la 
in a fourteen-choice-point maze. 


the goalbox. A second group (unfilled triangles) never did. The third group found 


no food in the goalbox through Day 10 (unfilled circles), but thereafter found food 
(filled circles). This group, which had been performing like the second group, 
quickly became like the first group. The argument was that the rats in this group 
had been learning the maze pattern all along, and that food in the goalbox was 
Necessary only to get them to demonstrate what they had already learned. (From 
Tolman and Honzik, 1930) 

hus learning could not be attributed to food as a 
n extended to reinforcers in general, and experi- 
aid to demonstrate that learning could occur 


Without reinforcement. But soon came the counter-argument that food in the 
Soalbox is not the only possible reinforcer for a rat’s running of a maze. The rat’s 
removal from the maze at the end of a run, its escape from the constrained spaces 
in blind alleys, or its return to the home cage where it has been fed all might func- 
tion as effective reinforcers. Experiments were then performed that varied the rat’s 
handling at the end of the run, the width of the maze alleys, and the rat’s home- 
Cage feeding. Each time one experiment showed that a particular reinforcer could 
Benerate maze learning, some other experiment demonstrated latent leaning in 
Such a way that the reinforcer at issue could not have been effective. And so it 


Went. 


food was in the goalbox, and t 
reinforcer. The argument was the 
ments on latent learning were $ 


gument could not be resolved and latent learning 


Even in principle the ar! 
l experimental issue. The reason was that a rat’s 


Bradually faded away as a critica 
87 


88 Consequences of Responding: Reinforcement 
negotiation of a maze inevitably involves the consequences of responding. At any 
choice point, one turn is followed by an entry into a blind alley and the other by 
an opportunity to move further through the maze; at the final choice point, only 
one turn is followed by entry into the goalbox, whether or not the goalbox 
contains food. The rat’s sniffing, touching, looking, and moving through the maze 
are consequential responses, even if this behavior is not as easily accessible to us as 
correct turns or entries into blind alleys. These consequences are what the rat 
learns, and whether to call them reinforcers is perhaps mostly a matter of 
preference. 

Sensory-motor learning. The consequences so far considered have mainly 
been major classes of stimuli, typically with biological significance for the 
Organism’s survival (e.g., food and water). But many presumably lesser 
consequences are important in our day-to-day interactions with our environment. 
When we open a book, a consequence is that we can read the text. When we listen 
to someone speaking, a consequence is that we hear what is said. When we reach 
out for a nearby object, a consequence is that we touch it. Each consequence may 
then set the occasion for new responses. When we finish one page of a book, we 
may turn the page and read the next; when the speaker has finished, we may ask a 
question or make a comment; when we touch the object, we may pick it up and do 
things with it. To the extent that behavior in each of these Circumstances continues 
because of its consequences, these cases may be discussed in the vocabulary of 
reinforcement (cf. Skinner, 1953). We may say that seeing reinforces looking, that 
hearing reinforces listening, and that touching reinforces reaching out (the several 


tesponses each occur in appropriate circumstances, but stimulus control is not 
critical to the present discussion ; cf. chapter 7). 
The interaction of sensor 


Source of controversy in the Psychology of learning. Theorists took sides in debates 
on whether 1 i 


are consequences of be 
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such as lights and sounds were received with skepticism. With repeated experiments, 
however, the phenomenon called sensory reinforcement became established (Kish, 
1966). For example, rats’ lever pressing in darkness increased if lever pressing 
briefly turned on a light. Although the effect on lever pressing was small and 
transient, the light had served temporarily as a weak reinforcer. Phenomena like 
these soon came to be discussed in terms of exploratory behavior and curiosity 
(e.g., Berlyne, 1960), and experiments were extended to a variety of sensory 
Consequences. For example, if a monkey is alone in an enclosed chamber, the 
Opportunity to look at other monkeys on the outside can be used to reinforce the 
Operation of a switch (Butler, 1957). 

In these experiments, the experimenter assesses the effects of sensory conse- 
quences on a response chosen for its ease of measurement. But in any environment 
the organism’s behavior will have inevitable sensory consequences. The organism 
Changes the part of the environment with which it comes in contact simply by 
moving from one place to another; the things it sees and touches change as it moves. 

An experiment by Held and Hein (1963) illustrates these relations between 
behavior and simple sensory consequences. Pairs of kittens were raised in darkness; 
their first experiences with visual stimulation were in the apparatus shown in Figure 
4.11. Both kittens received the same kind of visual stimulation: each wore a shield 
that prevented it from seeing its own feet and body, each was prevented from 
Seeing the other by the wide central post, and each saw the same pattern of black 
and white vertical stripes that uniformly covered the walls of their circular 
enclosure. The kittens were linked by a sort of miniature carousel, but one kitten 
Moved actively (A) whereas the other kitten was moved passively (2); The active 
kitten stood on the floor of the enclosure whereas the passive kitten stood inside a 
Carrier suspended a short distance above the floor. As the active kitten walked 
around the central post, the passive kitten in its carrier moved a corresponding 
distance on the other side, and if the active kitten turned around in place to walk in 
the other direction, the pulley system turned the passive kitten’s carrier so that it 
too turned to face in the new direction. Both kittens were exposed to the same 
Kinds of visual stimulation in the same temporal sequences, but the visual stimuli 
for the active kitten were consequences of its own behavior Whereas those for the 
Passive kitten were not; they depended on the active kitten’s movements rather 
than on its own. The two kittens were then given standard tests of visual-motor 
coordination such as visual paw-placement (normal kittens extend their paws 
toward a table edge or other horizontal surface if held in the air a short distance 
away from it). Even though their exposure to visual stimuli was equivalent, only 
the active kitten responded appropriately in these tests; the passive kitten later 
became able to respond appropriately, after it was allowed to walk about freely in 


a lighted room. দ্‌ ট Y 

This experiment by Held and Hein has much in common with a classic exper- 
iment by Stratton (1897), who for eight days wore prisms that inverted and reversed 
his visual fields. At first his world looked upside-down and backwards and his move- 
ments were not coordinated with his surroundings. For example, in walking he 
looked down to see where he was going, but because of the inverting prisms he 


found that he was looking at the ceiling instead of the floor. Similarly, he had 
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difficulty in pointing at or reaching for Objects, because things once seen below eye 
level were now seen above eye level and things on the right were now seen on the 
left, and vice versa. As time passed, however, coordination improved and Stratton 
teported that the world no longer even looked so upside-down. ; 
Stratton’s findings illustrate again the role of the consequences of behavior. 
Looking and moving in the visual field have different consequences with and 
without inverting prisms, and adjustment to the Prisms requires that these new 


consequences be learned. For example, seeing the floor as one walks is an important 
consequence, but looking down 


longer does so when one be 
floor is instead a conseque 
down can be defined eith 
Harris, 1965). If seeing th 


» Which produced that consequence in the past, no 
Eins wearing inverting prisms; with the prisms, seeing the 
nce of looking up (in these situations, of course, up and 
er relative to the visual field or relative to the body; cf. 
€ floor is a reinforcer while One is walking and one walks 
while wearing inverting pr 


isms, then Seeing the floor will reinforce the response of 
looking up instead of the response of looking down. 
With Held and Hi 


Consequences of Responding: Reinforcement 91 


Seeing a contour such as the edge of an object is a reinforcing consequence for eye 
movement. We should expect eye movements to become coordinated with events in 
the visual field. Available data from the eye movements of newborn infants are 
consistent with this view. For example, when shown a simple figure such as a 
triangle in a uniform visual field, infants tend to fixate more and more accurately 
On the edges and vertices of the triangle as time passes (e.8., Salapatek and Kessen, 
1966; cf. Girgus, 1976; Aslin, 1977). 

We have much to learn about 


and consequences can be. For example, 
Visual stimuli in an infant’s current area of fixation. If we chose stimuli known to 


be effective as reinforcers for eye movement, we could then present these stimuli 
as a consequence of various kinds of eye movement. Given a stimulus in the right 
portion of the visual field, we might let it disappear if the infant looked to the 
right, but move it to the region of fixation if the infant looked to the left (and vice 
versa for a stimulus in the left portion of the visual field). In other words, this 
arrangement would create a world in which the infant could fixate on an object 
only by looking away from it. If eye movements changed accordingly in this kind 
Of artificial environment, we might conclude that similar processes operate in our 
natural environment (cf. Schroeder and Holland, 1968). 

Our visual system is built so that, in the natural environment, particular 


responses will automatically have particular consequences. The relations between 
responses and these consequences are the basis for our acquisition of behavior. In 
ot often say that the organism 


situations in which an organism leams, we cann y 
has learned either stimuli or responses, but we usually can say that the organism 


has learned the consequences of its own behavior. Reinforcement does not explain 
this learning; we cannot appeal to it as a cause of learning. Instead, reinforcement is 
part of the description of what is learned even with respect to our simplest interac- 


tions with events in the world. 


how arbitrary such relations between responses 
suppose an optical system could project 


SUMMARY 
of the Law of Effect: puzzle boxes, mazes, runways, 


and operant chambers. Out of these apparatuses and methods, the p DUDE Sf 
reinforcement emerged as a descriptive term appropriate when responding increased 
because of its consequences. The reinforcement relation !s relative. A stimulus 
Providing an opportunity to engage in one response can reinforce a less probable 
response, and thus reinforcement Telations can be reversed by changing response 
Probabilities through such procedures as deprivation. The effects of reinforcement 
Are temporary, as demonstrated by the phenomena of extinction. The reinforce- 
ment of responses necessarily includes the presentation of stimuli, and extinction is 
therefore complicated because it involves both terminating a contingency and 
terminating stimulus presentations. The contributions of contingencies and stimulus 
Presentations to behavior can be analyzed by comparing reinforcement, extinction, 
and superstition procedures. Phenomena such as latent learning and sensory-motor 
learning demonstrate that reinforcement is not an explanation of learning; rather, it 
is part of the description of what is learned. Organisms learn contingencies, i.e., 


the consequences of their own behavior. 


We began with a brief history 


Consequences 
of Responding: 
Aversive Control 


JNISHMENT 
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Punishment and reinforcement have fairly straightforward histories. Punish- 
ment stems from the Latin poena, pain or penalty, and reinforcement from 
the Latin fortis, strong, which is related to such structural items as fort and 
burg. Escape and aversive are more interesting. Escape, as ex-, out of, plus 
cappa, cape, is derived from the Old North French escaper, to take off one’s 
cloak or, by extension, to free oneself from restraint. Aversive is derived from 
the Latin a, away, plus vertere, to turn. Vertere has a Germanic relative, the 
suffix -ward or -wards, to turn attention to, in the sense of guarding or regard- 
ing; this suffix appears in reward, an occasional synonym for reinforcement. 


on, reinforcement, in which the consequences of 
likely. There also exists a relation, punishment, 


in which the consequences of responding make responding less likely. Furthermore, 
a stimulus that reinforces when produced by a response may serve a different 
function when responding removes it: its removal may punish responding. Inversely, 
8 stimulus that punishes when produced by a response may reinforce responding by 
its removal. For example, money may be used to reinforce behavior, as when a 
Child is paid for the completion of a chore, but its removal may be used to punish 
behavior, as when the child’s allowance is canceled because of a misdeed. Similarly, 
a painful burn may punish behavior, 85 when one learns not to touch a pan just 
taken from a hot oven, but its removal or prevention may reinforce behavior, as 
When one learns to treat a burn with appropriate medication or to put on a kitchen 


glove while handling things around a stove. 
Except for reinforcement by presenting a stimulus (chapter 4), these relations 
are often grouped together as instances of aversive control. In other words, aversive 


control includes both punishment and reinforcement by the removal or prevention 
Of aversive stimuli. This chapter first treats punishment, and later turns to reinforce- 
ment by stimulus removal or prevention, in escape and avoidance procedures. 


So far we have emphasized a relati 
tesponding make responding more 


PUNISHMENT 


As an operation, punishment is arranging a consequence of responding that reduces 
the likelihood of the response. The stimulus arranged as a consequence is called a 
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Punisher. For example, if electric shock is delivered to a rat whenever the rat 
presses a lever, the lever press is said to be punished and the shock is said to be a 
Punisher, because this operation reduces lever pressing. In these respects, the 
vocabulary of punishment parallels the vocabulary of reinforcement (cf. Table 4-1). 

Like reinforcement, the term punishment has been applied to both operations 
and processes. Thus, in some literatures stating that a response was punished may 
mean either that the response produced a Punisher or that responding decreased 
because it produced a punisher. As with reinforcement, preferred usage will be to 
restrict punishment to the Vvotabulary of operations and to describe the process 


sponding. Because of varied usages in the literature, 


» Of punishment as Process, will occasionally be con- 
venient. 


The Vocabulary of reinfor 
Punishment by the Object of the 


Consider a child misbehavi. 
comes administers a spanking. T' 
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how a rat’s lever pressing might change when the rat’s lever pressing is followed 
first by reinforcement with food pellets and then by extinction. During baseline 
performance, when no consequence is arranged for lever pressing, responding occurs 
infrequently. When reinforcement begins, responding increases over the first few 
sessions, after which it remains at a fairly stable level. Extinction then reduces 
responding to a level that gradually approaches that during baseline. 

The bottom graph of Figure 5-1 shows how lever pressing might change 
during and after punishment of lever presses with electric shock. Because punish- 
ment reduces responding, some responding must exist to begin with or no decrease 
could be observed. In this example, substantial responding is already maintained 
by food reinforcement and reinforcement continues throughout all sessions; the 
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ment and punishment on the hypo- 
The top graph shows unreinforced 
resses produce food (reinforce- 


Figure 5.1. Comparing the effects of reforce. 
thetical lever pressing of a food-deprived rat. 


lever j i i when lever P 
pressing (baseline), the increase y Ce £ I 
ment), and te return 1 earlier levels when reinforcement is discontinued (extinc- 


tion). The bottom graph shows responding maintained by reinforcement (baseline), 
the decrease when response-produced electric shock is superimposed on this 
Performance (punishment), and the return to earlier high levels when punishment 
is discontinued (recovery). The bottom graph uses responding maintained 
throughout by reinforcement as the baseline against which to illustrate the effects 
Of punishment because a decrease in responding is not easily seen against a baseline 


In which responding is already infrequent. 
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effects of punishment can then be assessed by superimposing the punishment Opera- 
tion on this baseline. The baseline shows the responding maintained before lever 
pressing is punished. When punishment begins, lever pressing decreases during the 
first few sessions to a maintained low level. In recovery, punishment is discontinued 
and responding gradually returns to the level observed during baseline. 

The figure suggests considerable Symmetry of reinforcement and punishment: 
the former increases responding whereas the latter decreases it, but their effects 
continue while the procedures are maintained and disappear after the procedures 


end (responding returns to earlier levels). Actual rather than hypothetical effects 
of punishment are shown in Figure 5-2 (Estes, 1955). 
The effects in Figure 5-2 are 


Estes (1944) 
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“Tight” and punished by saying “wrong.” In the procedures that Thorndike used, 
Saying “wrong” had minimal effects, and the conclusion transmitted to subsequent 
investigators was not simply that saying “wrong” to a human learner was ineffective 
as a punisher but rather that punishment in general could not be demonstrated. 

Thorndike’s conclusion had sufficient impact that even the data in Figure 5-2 
Were interpreted to mean that punishment was an ineffective procedure. Based on 
the recovery of responding after punishment was discontinued, the argument was 
that punishment could suppress responding only temporarily and therefore was not 
to be taken seriously as a technique for changing behavior. Yet on the basis of this 
criterion, reinforcement also would have to be judged ineffective. For some reason, 
the criteria imposed for the effectiveness of punishment were different from those 
for the effectiveness of reinforcement. Even though responding was reduced during 
Punishment, investigators tended not to attend to that reduction and instead 
looked only at the recovered responding after punishment was discontinued. 

It is difficult to assess the role of the intellectual climate of those times. 
Psychologists were in a position to make pronouncements about the efficacy of 
reward and punishment that could influence child-rearing and educational practices 
across the nation, and ethical as well as scientific judgment undoubtedly 
contributed to the conclusions. What follows will Ee Boe eI 
in ] i is undesirable for eliminating bena 
le apo ated COTTE” 0 ly considered whenever available. But if 
this conclusion remains correct, it will be only because Thorndike and his 
Successor i e wrong reasons. s ক 

In fo RE ত Vs foatols turned again to studying punishment and 
demonstrated some conditions that modify its effectiveness In suppressing es 
(e.g., Azrin and Holz, 1966; Church, 1963; R. L. Solomon, 1964). For ne e, 
the following are some conclusions based on experiments with electric as ন 
Punisher of a pigeon’s food-reinforced key pecking: 2 ET fe 
Immediate the punisher, the more effective it will be;a punis ন Cr Ee 
Maximum intensity will suppress responding more effective Y ন or য 
introduced at low intensity and gradually increased to maximum inte! ty; 

ded periods of punishment, as 


effectiveness of the punisher may change over exten I | 
When a punisher or low intensity gradually becomes ineffective after many 


Presentati i d Holz, 1966, PP. 426-27). \ 
These Te only changed the criteria for the effectiveness of 


L Ww i judged adequately 
u i hether unishment had been judg 
ishment, but also examined e p Ss dged জক 


2ccordin riteria. On occasion a sin er or 
scene Hea al if a consequence of a weakly maintained EE ets 
Make a response disappear for days or months or perhaps even the res ডং e 
Organism’s life; thus, it was argued that punishment could have permanent effects 
after all. Such Gases are exceptional, and we need not be concerned that recovery 

to all intents and purposes unobservable. 


may sometimes be so slow that it is 
Consider the analogous argument for reinforcement. If someone bought a lottery 
ticket that won a ten-thousand-dollar prize, we would not be surprised to learn 
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that the occasional buying of lottery tickets continued throughout the person’s 
lifetime even though it was never again reinforced. We would not reject the 
existence of extinction on those grounds; we therefore should not reject recovery 
because of occasional instances of durable punishment effects. 

Some incidental features of punishment probably contributed to its unusual 
treatment in the history of the psychology of learning. We noted that a reduction 
in responding can be studied only if some responding already exists. A response 
that is never emitted cannot be punished. Thus, experiments on punishment often 
Proceed by superimposing it on responding maintained by reinforcement. But the 
effects of punishment may then depend on what maintains the responses. For 
example, if lever presses are punished with electric shock, food-reinforced lever 
pressing will probably decrease less if a rat is severely food-deprived than if it is 
only mildly food-deprived. 

Another difficulty is that punishing stimuli are likely to have additional 
effects that occur whether or not the sti 
reinforcement, the effect of 


punishers. For example, ele 
Pecks a key even if shocks 
to demonstrate that the s 
than when they occur inde 

The prejudices aga 
Procedures were someti 


The relativity of punishment 


In experiments on punishment, punishers are usually chosen for their reliable 
effect on a variety of responses, because such stimuli reveal the effects of punish- 
ment most clearly. One such punisher is electric shock, which can be measured 
accurately and can be presented at levels that are effective and yet do not damage 
tissue. Such stimuli, however, are only extreme instances of punishers, and even 
stimuli that ordinarily serve as reinforcers can become punishers under appropriate 
conditions. For example, drugs that at first have reinforcing consequences may later 
become aversive (as when a dose of LSD produces a bad trip). Like reinforcers, 
punishers cannot be defined in absolute terms nor can they be specified in terms of 
any common physical properties. Rather, punishers must be assessed in terms of 
the relation between the punished response and the responses produced by the 


Punisher. 

The Premack principle of reinforcement (chapter 4) stated that an oppor- 
tunity to engage in a more probable response will reinforce a less probable response. 
The analysis has also been extended to punishment (Premack, 1971). Let us return 
to the apparatus that can control a rat's opportunities to run in a running wheel or 
to drink from a drinking tube. It has been modified by a motor that can either lock 
the running wheel in position, thereby preventing the rat from running, or rotate it 
at a fixed speed, thereby forcing the rat to run. Rotating the wheel can then be 
made a consequence of some response, Such as drinking from the drinking tube or 


Pressing a lever. 

In this apparatus, depriving the rat of an opportunity to Tun while giving it 
free access to water makes running more probable than drinking, and depriving 
the rat of water while giving it an opportunity to run makes drinking more probable 
than running. Rotation of the wheel can then be made a consequence of drinking: 
each time the rat drinks, the wheel begins to turn and the rat is forced to run. When 
running is more probable than drinking, this operation increases the probability of 
drinking, and it is appropriate to Sty that drinking is reinforced by running. But 


When running is less probable than drinking, this operation has an opposite effect: 
now the probability of drinking decreases s its consequence, and it 


when running i 
is appropriate to say that drinking is punis This example illustrates 


hed by running. 
either reinforcement or punishment of a single response (drinking) by a single 
Consequence (enforced running), depending on the relative probabilities of these 
Tesponses. The stimuli and responses in typical experiments on reinforcement and 
Punishment had been chosen to make those procedures work (e.g., with food- 
deprived rats, eating is far more probable than lever pressing), and thereby obscured 
the potential reversibility of consequences 4s reinforcers and punishers. Responding 
Can be raised or lowered by changing its consequences, and these effects are deter- 


Mined by the behavioral and not the physical properties of the consequences. 


Side effects of punishment 


As with reinforcers punishers can have effects independent of their 
ৰ anism is shocked or burned or pinched, 


Contingent relation to responses. If an Org 
99 
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some of its responses may have little to do with Whether or not these events were 
brought on by the organism’s own behavior. Difficulties arise in analyzing punish- 
ment because such effects must be distinguished from those that depend on the 
relation between responses and their consequences. Some effects of shock may be 
Primarily physiological, as when Successive shocks Systematically reduce a rat’s 
skin resistance (Campbell and Teghtsoonian, 1958). Depending on the nature of 
the shock source, the effectiveness of later shocks may then vary with the rat’s 
resistance. Other effects are primarily behavioral, as when the apparatus fails to 
prevent responses by which the Organism can reduce its contact with the shock 
Source (e.g., fur is an insulator, and rats have Sometimes minimized effects of shock 
4S a punisher by pressing the lever while lying on their furry backs; Azrin and 
Holz, 1966). In either Case such side effects must be taken into account. 

Eliciting effects of punishers. Figure 5 
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events (Rachlin, 1967, p. 87). Just as we must distinguish between the effects of 
reinforcer deliveries and those of the contingent relation between responses and 
reinforcers, so also must we distinguish between the effects of punisher deliveries 
and those of the contingent relation between responses and punishers. 

These considerations are illustrated by the punishment of two classes of 
species-specific behavior in the Mongolian gerbil (Walters and Glazer, 1971). Sand- 
digging, part of the gerbil’s nesting behavior, consisted of scooping sand and kicking 
it backwards; alert posturing, a defense reaction occasioned by sudden or aversive 
Stimuli, consisted of standing erect on the hindpaws with ears cocked. Both 
responses occurred frequently in the sand box where the experiment was 
Conducted. Because delivering shock was difficult in the sand box, a tone was 
established as an aversive stimulus by repeatedly pairing it with shock in a different 
setting. When this stimulus was contingent upon digging, it was an effective 
Punisher: digging decreased when it produced the tone and recovered after the 
contingency was discontinued. The decrease in digging was accompanied by an 
Increase in alert posturing. But when the tone was contingent upon alert posturing, 
alert posturing increased when it produced the tone and did not return to earlier 
levels for several sessions after the contingency Was discontinued. The increase in 
alert posturing was not accompanied by any appreciable change in digging. The 
tone presumably functioned as a punisher for digging but not for alert posturing 
because the eliciting effect of the tone on alert posturing Was more powerful than 
its punishing effect. Thus, in punishment as in reinforcement it Is important to 
acknowledge the separate effects of response-stimulus contingencies and stimulus 


deliveries. 

Let us turn to another example (Kelleher and Morse, 1968). Consider a 
monkey in a restraining chair. Shock electrodes are placed on the monkey s tail, 
and the monkey sits before a lever. At the end of successive ten-minute intervals, 
a lever press will deliver shock to the monkey's tail. A few minutes after the 
Monkey has been placed in the chair, it begins to press the lever. Eventually ten 
minutes elapse and the next press delivers a shock. The monkey briefly jumps and 
Squeals, and for a while does not press the lever. But it does begin pressing again, 
tesponding more and more rapidly until it shocks itself at the end of the next ten- 
minute interval. This performance is repeated over and over throughout extended 
daily sessions. On days when shock is discontinued, the monkey's lever pressing 
Virtually ceases; when shock is reinstated, lever pressing returns. If the shock level 
1S raised, lever pressing increases; if the shock level is lowered, it decreases. Shocks 
depend completely on the monkey’s behavior; the monkey would not receive 


Shocks if it did not press the lever. Why is it then that the monkey does not simply 


Stop pressing? eal te 
Behavior like this can be generated in many ways and is easily maintained 
(e.g., Morse and Kelleher, 1977, PP- 193-98). The paradox is that the same shock 
: duced by the first lever press after the end of 


that maintains responding when pro Ie t 
Successive ten-minute intervals will suppress responding if produced instead by 
every lever press. How do we decide whether the language of punishment is appro- 


Priate? If every lever press produces shock, we can call the shock a punisher because 
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this consequence reduces responding. But if lever presses produce shock only at 
the end of successive ten-minute intervals, we should call the shock a reinforcer 
because this consequence generates responding. (It would not help to yield to the 
temptation to call the monkey a masochist. Masochism is simply a name we use 
when a stimulus that we believe should be a Punisher serves as a reinforcer. The 
term does not provide an explanation.) 

One possibility is that the appropriate comparison should be between 
response-produced shock and tesponse-independent shock rather than between 
response-produced shock and no shock at all. We know that electric shock elicits 
such manual responses as lever pressing in monkeys (chapter 3). If response- 
independent shock produces more 


Shock, we may conclude that the eliciting effects of shock are so powerful that they 


studies would be unjustified 
SO easily and superficially. 
these performances, these e 
and Kelleher, 1977). We do 
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When shock was discontinued. (From Holz and Azrin, 1961, Figure 3.) 
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Thus, a behavioral analysis may be relevant to such problems as child abuse. 
Behavior is determined in multiple ways, and the point of a behavioral analysis is 
to break complex situations down into their simpler components. 
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Summary 

ment. Reinforcement is defined by 
nishment is defined by decreases. 
f reinforcement: a punisher is a 


র্‌ Punishment is the inverse of reinforce 
increases in consequential responding whereas pu 


The vocabulary of punishment parallels that ©. ; 
stimulus and punishment is an operation or 4 process. The effects of punishment 


are ordinarily temporary; responding usually recovers to earlier levels after punish- 
ment is discontinued. In studying punishment, baseline levels of responding must be 
high enough to make the reduction in responding easily visible. Thus, experiments 
often superimpose punishment on 8 baseline of reinforced responding. An analysis 


of punishment in terms of baseline response probabilities shows that the effective- 
ness of punishers, like that of reinforcers, is determined by the relation between 
the punished response and the resp d by the punisher. In 


onses occasione 
Punishment, a more probable response 


forces the organism to engage in a less 
Probable response. The reversibility of reinforcement and punishment implied by 
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this relativity can be demonstrated experimentally. Punishment effects can be 
complicated by eliciting or discriminative effects of punishers. The task of an 
experimental analysis is to separate such side effects from the primary effect of 
the punisher. These effects occur because punishment necessarily includes both 
stimulus presentations and a contingency between responses and stimuli; the effects 
of the stimulus presentations must be distinguished from those of the contingency. 


REINFORCEMENT BY STIMULUS 
REMOVAL (NEGATIVE 
REINFORCEMENT) 


Organisms may terminate as well as produce stimuli. For example, a rat does not 
Ordinarily expose itself to shock, and if shock does Occur the rat will get away from 
it if given the opportunity. Just as presenting an aversive stimulus can be the basis 
for punishment, terminating or preventing it can be the basis for reinforcement. 
When a response terminates or prevents an aversive stimulus and the response 
thereby becomes more probable, the stimulus is called a negative reinforcer and the 
Operation is called negative reinforcement. Positive and negative reinforcement are 
therefore distinguished by whether a response produces or removes a stimulus. 

Later we will encounter some problems in the vocabulary of positive and 
negative reinforcement. Here, we mainly acknowledge a vocabulary with substantial 
Precedent. It is now generally agreed that positive and negative, as modifiers of the 
term reinforcement, should refer to the nature of the consequence produced by 
responding (whether the response adds something to the environment or takes 
something away), and that negative reinforcer should refer to the stimulus itself 
and not to its removal (if removal of shock reinforces a Tat’s lever press, shock is 
the negative reinforcer and not the shock-free period that follows the response). 
This vocabulary was established gradually, however (for some time, negative 
reinforcement was defined as above in Some textbooks but as equivalent to punish- 
ment in others). As indicated by the etymologies introducing each chapter, 
language evolves. Thus, the evolving language of reinforcement may eventually 
make the distinction between positive and negative reinforcement of marginal 
value (cf. Michael, 1975). Nevertheless, when we do invoke the vocabulary of 


positive and negative reinforcement and Punishment, we will adhere to the 
dominant contemporary usage, summarized as follows: 


A. Reinforcement make 
B. Punishment makes t 
C. The modifier 
D 
E 


and not the absence of stimuli (given 


to make an unambiguous distinction). 


The last item is Parenthetically qualified because the distinction is sometimes 
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difficult. For example, is it more appropriate to think of the effects of a traffic 
ticket in terms of presentation of the paper on which the ticket is written or 
removal of money from one’s funds on payment of the fine? Similarly, is it more 
appropriate to think of the effects of performing well on an examination in terms 
Of presentation of a high grade, removal of social pressures exerted by parents or 
teachers or peers, or prevention of lower grades that might have been but were not 


given? 


Escape 


e simplest examples of negative-reinforcement 


Operations: an organism’s response terminates an aversive stimulus. Escape proce- 
dures are contrasted with avoidance procedures, in which the response prevents or 
delays an aversive stimulus. This vocabulary is consistent with everyday usage: we 
escape from aversive circumstances that already exist, but we avoid potential 


aversive circumstances that have not yet occurred. 

Consider a rat in a compartment with an electrified grid floor. The conditions 
for escape may be arranged simply by constructing the compartment so that the 
rat can leave it. Movement from one place to another as the escape response is 
illustrated in Figure 5-5, from a shock-escape experiment with rats in an electrified 


runway (Fowler and Trapold, 1962). Running speed was fastest when shock was 
hed the end of the runway. The longer the delay 


terminated as soon as the rat reac IRL 
between reaching the end of the runway and terminating shock, the slower the 
running. This is one of many examples of quantitative effects of reinforcement. 

inforcement, immediate reinforce- 


For exam i itive and negative re 
ple, with both positive E H 
ment is more effective than delayed reinforcement (delay parameter ) and large 


reinforcers are more effective than small reinforcers (magnitude or intensity para- 
meter). 
Movement from one place to anot 


Escape procedures are th 


her is an escape response that is usually 


easy to establish, but more discrete responses such as a rat’s lever presses are often 
Preferred. In the presence of electric Shock a rat's press may tum off the shock, 
Or in the presence of a bright light the press may turn off the light (e.g., F. S. 
Keller, 1941). If in either case responding increases because of its consequences, we 
may say the response has been reinforced. Let us therefore compare positive and 
negative reinforcement: in the absence of food, responding that produces food 
increases; in the presence of shock, responding that removes increases. The 
Parallel is straightforward. Yet in research on negative reinforcement, escape 
Procedures have not received as much attention as more complex procedures with 
aversive stimuli. Research on negative reinforcement is dominated by avoidance, 
in which aversive stimuli are prevented or delayed by responses that occur in their 


absence. 
2 i i i titative relation that determines the effect of other 
A is a variable in a quan! “th ct 
Variables Eee fests of reinforcement may depend jointly on oY and magnitude of 
Teinforcement, and we would therefore say that both delay and magnitude are parameters of 
reinforcement); for most purposes in the analysis of behavior, parameter may be read as 
Synonymous with variable. 
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Figure 5-6. Schematic representation of positive reinforcement (top) and negative 
reinforcement (bottom), showing the different temporal relations between the 


reinforced response and other responses produced by the reinforcer. In food rein- 
forcement, responses occasioned by food (chewing, etn) te the I 
lever press fore less likely to compete with lever pressing an in 

press, and are there hock (Gumping, etc.) precede the rein- 


Shock escape, when responses occasioned by s 
forced lever press. 

ikely to decrease with termination of the shock that 
generated responses no longer compete 
f shock the lever press cannot be 


these other responses are l 
generated them. At this point, the shock 
with the lever press, but in the absence 0 
reinforced by shock termination. 


Similarly, if the negative reinforcer is a bright light from which the rat may 


escape by pressing a lever, the rat may reduce the effects of the light by closing its 
eyes and hiding its head under its paws in a corner of the chamber. Any movement 
from that position will be punished by greater exposure to the light, and thus the 
rat will not be likely to come out of the corner and press the lever. But rats can 
learn to escape from light by lever pressing, especially if the escape response is 50 
quickly learned that other competing responses do not become well established 
(F. S. Keller, 194). টা ls $ 

The ambiguity of distinguishing between positive and negative reinforce- 


ment. In distinguishing between positive and negative reinforcement, an important 
criterion is whether responses generated by the reinforcer occur at times when they 
can interfere with reinforced responses. Thus, a distinction based merely on whether 
stimuli are presented or removed may be of limited utility. An experiment on 
escape from cold illustrates the point (Weiss and Laties, 1961). A rat was placed in 
a cold chamber in which its lever presses were reinforced by the operation of a heat 


lamp. On the one hand, this procedure may be regarded as positive reinforcement, 
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because it involves adding energy to the environment in the form of heat when a 
lever press occurs. On the other hand, relative to a neutral temperature, cold may 
function as a stimulus through its effects on the temperature receptors in the rat’s 
skin. Although cold is the absence of heat, it is certainly a significant event in the 
Organism’s environment and may be regarded as aversive. By this interpretation the 
procedure should be called negative reinforcement, because turning on the heat 
lamp when a response occurs terminates the effects of cold. 

In escape from cold, it is difficult to specify whether reinforcement involves 
presenting or removing a stimulus, and it is easy to find ambiguities in other 
familiar instances of reinforcement. For example, we could argue that water as a 
reinforcer terminates aversive stimuli generated by a dry mouth, or that food as a 
reinforcer terminates aversive stimuli generated by the depletion of nutrients in the 
blood stream (cf. Hull, 1943). Why then did the distinction between positive and 
negative reinforcement ever take on behavioral significance? Let us reconsider that 


rat in the cold. Before the reinforced lever press occurs, the rat may huddle in a 
corner and shiver. These res 


the lever. Once the lever 


not escape from cold by 
€ reinforced response and 
ment, this example seems 
Od or water (cf. Figure 5-6). 


response. 


Avoidance 
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can postpone the violation signal indefinitely. If a shock is delivered at the end of 
&n RS-interval, because no response occurred, control switches back to the SS- 
interval clock. With this procedure, called Sidman avoidance or continuous avoid- 
ance, avoidance responding can be studied independently of escape responding; 
shock can be prevented by avoidance responses, but once delivered it is so brief 
that there is little if any opportunity for its termination. 

Data are illustrated for one rat’s lever pressing in Figure 5-7, which shows 
rate of responding as a function of RS interval with SS interval as a parameter. 
Across functions, the RS interval that produced the maximum rate of lever pressing 
depended on the SS interval. First consider an avoidance schedule with an RS 
interval of five seconds and an SS interval of two seconds. Any responding at all 
will reduce shock, and a rate of one response in every four seconds will avoid shock 
completely. When the RS interval is shorter than the SS interval, however, some 
patterns of lever pressing can increase rather than decrease shock. For example, 
consider now an avoidance schedule with an RS interval of two seconds and an SS 


interval of five seconds. A rat that never lever presses will receive a shock every five 
the rat presses every three seconds, a 


seconds, or twelve shocks per minute. But if 

shock will be delivered two seconds after each lever press and the rat will therefore 

receive twenty shocks per minute. Strictly, if this increase in shock rate reduced 
te to say that responding was punished (ctf. 


responding it would be appropria 
Sidman, 1953, 1954). Only if the rat’s responding is fast enough that two seconds 
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never elapse without a lever press will the rat completely avoid shock, but the 
increase in shock produced by lower rate responding may prevent it from ever 
attaining such a performance. 


Species-specific defense reactions. The advantage of avoidance procedures 
over escape procedures is that opportunities for the reinforced response occur in 
the absence of the aversive stimulus. Thus, other responses elicited or occasioned by 
the aversive stimulus need not continuously compete with avoidance responding. 
Nevertheless, just as the ease with which different responses can be established as 
escape responses varies with different organisms, so also does the ease with which 
different responses can be established as avoidance responses. For example, pigeons 
avoid shock more readily if they can do so by moving from one side of a chamber 
to another than if they can do so by pecking (e.g., Macphail, 1968). The demonstra- 
tion of avoidance responding makes it more difficult to make the case that such 
differences depend on competition between the reinforced response and other 
responses occasioned by the aversive stimulus. It has been argued instead that the 
differences arise because organisms are variously equipped with defense responses 
that are species-specific and that success with avoidance procedures will depend 
upon whether the experimenter chooses a response that the organism is already 
prepared to emit in aversive situations (Bolles, 1970; Seligman, 1970). 

Bolles summarizes the argument as follows: 


What keeps animals alive in the wild is that they have very effective innate 
defensive reactions which occur when they encounter any kind of new or 
sudden stimulus. These defensive reactions vary somewhat from species to 
species, but they generally take one of three forms: Animals generally run or 
fly away, freeze, or adopt some type of threat, that is, pseudo-aggressive 
behavior. These defensive reactions are elicited by the appearance of the 
predator and by the sudden appearance of innocuous objects. These responses 
are always near threshold so that the animal will take flight, freeze, or 
threaten whenever any novel stimulus event occurs. It is not necessary that 
the stimulus event be paired with shock, or pain, or some other uncondi- 
tioned stimulus. The mouse does not scamper away from the owl because it 
has learned to escape the painful claws of the enemy; it scampers away from 
anything happening in its environment, and it does so merely because it isa 
mouse. The gazelle does not flee from an approaching lion because it has been 
bitten by lions; it runs away from any large object that approaches it, and it 
does so because this is one of its species-specific defense reactions. Neither 
the mouse nor the gazelle can afford to learn to avoid; survival is too urgent, 
the opportunity to learn is too limited, and the parameters of the situation 
make the necessary learning impossible. (Bolles, 1970, p. 33) 


Two-process theories of avoidance. Avoidance, as the quotation suggests, has 
often been a focus of controversy. Bolles’ point concerns the extent to which 
avoidance behavior is learned. Another issue is specifying what reinforces avoidance 
responding. When a successful avoidance response occurs, the important 
consequence is that nothing happens to the Organism. How can the absence of an 
event affect behavior? According to some views, avoidance responding is 
maintained because the organism is escaping from some properties of the situation 
that accompanied past aversive stimuli. This view evolved from earlier procedures in 
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which a warning stimulus preceded shock and the organism could prevent the shock 
by emitting the avoidance response in the presence of the warning stimulus. 
Avoidance was most easily established when the avoidance response not only 
prevented the shock but also terminated the warning stimulus. It was assumed that 
the warning stimulus acquired aversive properties of its own through its consistent 
relation to the aversive stimulus, and therefore that this immediate consequence, 
escape from the warning stimulus, was the effective reinforcer. In fact, one 
purpose of Sidman’s avoidance schedule was to demonstrate avoidance responding 


without a warning stimulus. 


This view, that the warning stimulus acquired aversive properties from its 


consistent relation to the aversive stimulus and that the termination of the warning 
stimulus therefore reinforced the avoidance response, was called a two-process 
theory of avoidance (Mowrer and Lamoreaux, 1946; Kamin, 1956, 1957; see also 
Hoffman and Fleshler, 1962). The main issue was the nature of the reinforcer. 
Some theorists were willing to accept the fact of avoidance responding without 
appealing to some reinforcing event occurring at the moment of the avoidance 
response. Others felt that it was necessary to specify such an event. When, as in 
Sidman avoidance, the event could not be located in some warning stimulus in the 
environment that terminated with an avoidance response, these theorists instead 
located the event inside the organism. It was argued, for example, that the 
Organism’s state just after an avoidance response, when shock was not imminent, 
Was reinforcing relative to its state long after a response, when a shock might occur 
at any moment. The latter state acquired aversive properties through its close 
relation to shock and like a warning stimulus was terminated bya response. Because 
this change of state presumably occurred immediately with each avoidance 
response, it was assumed that an account based on other events at some time 


distant from the response was unnecessary. | } 
Debates on the status of such theories of avoidance responding have a com- 

plex history (e.g., Anger, 1963; Herrnstein and Hineline, 1966; Herrnstein, 1969; 
Hineline, 1977), with some even changing sides (e.8., Schoenfeld, 1950, 1969). 
Sometimes the issues were reduced to experimental questions: For example, if 
ust the organism be able to reduce the 


avoidance responding is to be maintained, m bs 
PONTE for the organism to be able to 


total number of shocks in a session, Or is it sufficient J 
delay individual shocks even though the same number of shocks is eventually 


delivered? It turns out that either condition will maintain avoidance responding 
(Hineline, 1970; E. T. Gardner and Lewis, 1976). 

Another issue was whether avoidance responding could be maintained when 
Tesponding reduced the likelihood of shocks but did not reliably postpone each 
shock as in Sidman avoidance. Herrnstein and Hineline (1966) arranged an 
avoidance schedule in which shock was delivered to a rat with a specified 
probability at the end of every two seconds. Shock occurred with one probability 
if no lever press occurred and with a different (usually lower) probability if a lever 
press did occur. For example, in one condition lever presses reduced the probability 
of shock from 0.3 to 0.1. Avoidance lever pressing was generated by this procedure, 
but response rates were sufficiently low to allow varied interpretations (e.g., that 
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responding occurred because each shock simply occasioned or elicited one or two 
lever presses). More important, even though no consistent temporal relation existed 
between individual responses and individual shocks, the likelihood of shock 
increased more rapidly on the average as time elapsed without lever pressing than as 
time elapsed after a lever press.. Thus, it was possible to argue that the reduction in 
average aversiveness produced by a response was a sufficient immediate 
consequence to reinforce avoidance responding. And so it went. 

These issues grew out of the difference between molecular and molar orienta- 
tions toward the analysis of behavior. The first approach deals with behavior in 
terms of moment-to-noment sequences of events in a given setting; the second 
deals with properties that can be measured only over extended time periods. For 
example, a molecular approach to avoidance examines the individual time intervals 
that separate individual responses and individual shocks whereas a molar view 
examines the more general relation between rate of responding and rate of shock 
Over a large sample of responses and shocks (note that rate can be determined only 
by sampling events over an extended period of time). 

These issues have not been and perhaps cannot be resolved. Either molecular 
Or molar properties of behavior may be important in different settings. Just as 
Organisms may come into situations with species-specific responses likely to be 
engendered by certain events, it is also reasonable to assume that evolution has 
equipped them with the Capacity to respond differentially to a variety of properties 
of the situations in which they find themselves. Thus, there seems no a priori 
Justification for assuming that an organism whose responding is determined by the 
molecular properties of one situation (e.g., the consistent temporal relations 
between Tesponding and shocks established by the RS and SS intervals of Sidman 
avoidance) would be incapable of responding according to molar properties of 
another situation (e.g., the consistent overall relations between rate of responding 
and rate of shock established by a probabilistic avoidance schedule). If this is so, 


it is not a matter of choosing one or the other approach but rather of deciding 
which approach is more appropriate to the analysis of any given situation. 
Extinction 
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between the response and the feeder. Both methods have the same effect: food is 
no longer delivered. That is not so with shock escape or shock avoidance. If the 
Shock source continues to operate but responses no longer remove or prevent 
shocks, shocks continue to be delivered. In Sidman avoidance, for example, all 
shocks would be controlled by the clock for the SS interval; responses would no 
longer operate the clock for the RS interval. This extinction procedure would dis- 
continue the contingency between responses and shock, but if responding had 
maintained a low shock frequency it would also substantially increase shock deliver- 
ies. The effects of changing the rate of shock would have to be separated from 


those of changing the contingency. 
The first method, discontinuing 
mon extinction procedure in avoida 


the aversive stimulus, has been the more com- 
nce. Because the absence of the aversive 


stimulus cannot be a reinforcing consequence unless the aversive stimulus is 
Sometimes presented, however, the second procedure, presenting the aversive 
stimulus while discontinuing the consequences of responding, is sometimes regarded 
as more appropriate. In any case, extinction after negative reinforcement shows 
that the effects of negative reinforcement are temporary, but the time course of 
extinction depends on the nature of the extinction operation and the rate at which 
aversive stimuli occur before and after the onset of extinction (cf. Hineline, 1977, 


PP. 377-81). 


Positive and negative punishment 


The distinction between positive and ne 
extended to positive and negative punishment 
are possible). Responses may be punished by some 


rotation of a wheel that forces a rat to run. Respo 
terminating events. For example, the removal of food as a consequence of a food- 


deprived rat’s lever pressing is likely to reduce lever pressing and therefore is an 
example of negative punishment. The problem is that punishment might be difficult 
to demonstrate in this instance. If the rat is food-deprived and food is available, the 
rat will probably eat rather than press the lever and thus we will have few oppor- 
tunities to punish lever pressing by removing food (consider Figure 5-6 with food 
and shock reversed). For this reason, studies of negative punishment have usually 
removed a stimulus in the presence of which responses are reinforced rather than 
temoving the positive reinforcer itself. (This parallels the emphasis on avoidance 


rather than escape in studies of negative reinforcement.) 
For example, assume that two levers are available to a monkey and that 


Presses on one lever produce food whenever 8 light is on. We can expect some 
Pressing on the other lever, but this pressing can be punished by making each press 
Produce two minutes during which the light is turned off and presses on the first 


lever no longer produce food. Such periods are called time-out, and the operation 
ime-out from positive reinforcement (e.s., 


is sometimes called punishment by j 
Ferster, 1958; Kaufman and Baron, 1968). Negative punishment has also been 
called onion training, but this name is typically restricted to cases in which the 
Operation is signalled (e.8., Sheffield, 1965). 


gative reinforcement can easily be 
(though here too ambiguous cases 
events, such as shock or the 
nses also may be punished by 


SUMMARY AND EXTENSION: 
THE LANGUAGE OF AVERSIVE 
CONTROL 


The presentation or removal of stimuli can reinforce or punish behavior. 5) 
empirical findings with reinforcement and punishment Suggest that reinforcemen 
is most effective if the reinforced response is compatible with or at least 
independent of the responding occasioned by the reinforcer, but that punishment 
is most effective if the punished response is incompatible with or is interfered with 
by the responding occasioned by the punisher. Thus, it may be easy to reinforce 
jumping or running with shock removal (escape), but it may be difficult to punish 
jumping or running with shock presentation. y Y 

The stimuli that can reinforce by their presentation can also punish by their 
removal, and vice versa. We have therefore spoken of punishers, of negative reinfor- 
cers, and of aversive stimuli. Each was introduced in a different context, but this 
Was fitting because context determines the behavioral function of any stimulus. 
Thus, aversive stimuli were introduced in connection with elicitation phenomena, 
Punishers entered with the discussion of consequences that reduced responding, and 
negative reinforcers joined these other terms as one class of consequences that 
raised response probability. It would be convenient if we could assume that each 
term identified different aspects of a single category of environmental events. We 
might then speak of shock interchangeably as an aversive stimulus, a punisher, or a 
negative reinforcer, depending on the current situation. 

For many stimuli much of 
If we know a stimulus is effective a 
be effective as a negative reinforce. 
calling it aversive. Consistencies are 
Origins in relations among the prob 
hierarchy. But we must beware c 
The very fact that we may easily r 
may not so effectively punish it w. 


by not responding, the organism 
punisher for responding. But the. 
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seri jety i indi 
En concern. For example, our society is not indifferent to whether a parent 
inforces cooperative behavior with praise or punishes the failure to cooperate 


with beatings. 


We can speak more easily in terms of discrete responses than in terms of the 


absence of responses, and therefore we prefer direct descriptions in terms of 
measurable responses such as lever presses Of key pecks whenever such descriptions 
are possible. An organism may exhibit more or less behavior at any given time, and 
even though we may never observe a complete absence of behavior as long as an 
Organism lives, we need not assume that all failures to act are in themselves actions. 
Just as we do not have to achieve a temperature of absolute zero to recognize 
temperature as a dimension of matter, So We do not have to produce a completely 
nonbehaving live organism to recognize that the quantity of an organism’s behavior 
can change from time to time. The implication is that if we punish a response, we 
should know what happens to that response before we go looking for an account 
in terms of other responses, and we should know what else the organism is doing 


before we begin speaking of not-responding as behavior. 


The ethics of aversive control 

The behavioral properties of aversive control would lead us often to agree 
with those who argue against aversive control on ethical grounds. For example, 
it is important to point out that a parent who arranges aversive contingencies for 
a child’s behavior may acquire aversive properties; to the extent that the child then 
learns to escape from or avoid the parent’s company, contingencies other than 
those available to the parent are likely to begin to influence the child’s behavior. 
But there are possible exceptions: if punishment seemed the only technique 
available to reduce the dangerous self-mutilating behavior of an autistic child, 
Punishment might be a lesser evil than the permanent damage the child might self- 
inflict. To the extent that ethical precepts are concerned with acceptable and 
Unacceptable outcomes of our actions, examining the consequences of responding 


Cannot help but be important. 
© Skinner outlines our present 
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tat i i ishment to alterna- 
Civil made some progress in turning from punis 1! na 
tive ERE Fa ing gods and hell-fire have given Way to an 
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asri i fair wages are recognized as an improvement over 
griculture and indus™ for the reinforcements naturally 
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port of the behavior 
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i more positive Sup 
parish ies been SapPlenereresot the toverning Sten) DL ren 
a long way from explo. i and we are not likely to make 
any real advance so long formation about punishment and the alter- 
natives to punishmen t the level of casual observation. As a 
consistent picture of plex CONT of punishment 
emerges from analytical research, 3 Fe COU 102 and skill 
needed to design alternative procedures. (Skinner, » PP. 192-93) 
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The Structure 
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SIX 


nses. Operant can be traced to the Latin opus, 
f operation and copy. Class can be traced to the 
Latin classis, a division of Roman citizens eligible for military draft and 
perhaps thereby a summons or call. Class is related to claim and calendar 
through the Indo-European root kel, to shout. It is tempting to assume a 
relation to the similar root, gal-, to call, which gives us call and recall, but the 


evidence does not support that assumption. 


An operant is a class of respo 
Work, which is also a source 0 


g stimuli (elicitation) and by 


ding (reinforcement and punishment). One 
ations is in terms of the relative positions 


We have seen how to change behavior by presentin. 


arranging consequences for respon 


Way to discuss the effects of these oper I EBUVED 
Of responses in the behavioral hierarchy. The ordering of responses in this hierarchy 


is a property of behavior with which the organism comes ES? the organism’s 
behavior consists of a repertory of responses, each with a different probability. 
But if we restrict our attention only to these responses, We 10S of the most 
interesting features of those changes in behavior called learning, i.e., the circum- 
stances in which an organism comes to respond in new Ways. In learning, an 
Organism may emit responses it was unable to emit before learning began. We must 
therefore examine how new responses may be added to an organism behavioral 
tepertory. In this chapter, we begin by considering shaping, a procedure for 
Senerating new responses. This will lead us to consider how classes of responses are 
defined as units of behavior, in the concept of the operant. Ti SeCOUCEPLBTOVINES 


the basis for discussing the structure of behavior. 
The rat’s lever press and the pigeon’s key peck lave offen Servet as our 


examples of responses. But if we simply place a rat in front of a lever Or a pigeon in 
front of a key, we may or may not observe a lever press Or a key peck. With some 
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organisms, we will be fortunate and these responses will occur after not too long a 
wait; with others we may have to wait one or several hours; and an occasional 
Organism may remain so long without responding that our patience is exceeded. 
Reinforcement cannot have any effect if the response to be reinforced is never 
emitted. An alternative is available, however. Instead of waiting for the response, 
the experimenter may generate a response by successively reinforcing other 
responses that more and more closely approximate it. 


SHAPING: THE DIFFERENTIAL 
REINFORCEMENT OF SUCCESSIVE 
APPROXIMATIONS 


Consider the pigeon’s key peck. Once the Pigeon begins to eat whenever the feeder 
is operated, the experimenter Operates the feeder only when the pigeon turns 
toward the key. After reinforcing two or three movements toward the key, the 
experimenter may then reinforce not Just any movement toward the key, but only 
movements that include forward motion of the pigeon’s beak. By this time, the 
Pigeon spends most of its time in front of the key, and the experimenter can shift 
attention from the Pigeon’s turning toward the key to the pigeon’s forward beak 
movements. The forward beak movements 
than turns toward the key, 
will continue to Occur, it is 
By this time, the pigeon’s 
them strikes the key. At 


apparatus can be arranged 
cally. 


this point, the experimenter can withdraw, because the 
So that subsequent pecks Operate the feeder automati- 
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explicitly stated. For example, reinfor 
responding efficiently than reinforc 
formulated so readily. For example, shapi 
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ing a large sidewise beak movement that finishes in front of the key. On the other 
hand, whatever the response and the organism, the experimenter should not miss an 
Opportunity to reinforce a response if it more closely approximates the response to 
be shaped than any response that had been reinforced before. We are not so much 
concerned with the details of the pigeon’s key peck as with the general properties 
of shaping, which may be applicable to the mastery of such varied skills as 
Bymnastics, handwriting, playing a musical instrument, and pronouncing the sounds 
Of a foreign language. 

As a procedure, shaping is based upon differential reinforcement: At successive 
Stages, some responses are reinforced and others are not. In addition, as responding 
changes, the criteria for differential reinforcement change, in successive approxima- 
tions to the response to be shaped. The property of behavior that makes the 
differential reinforcement of successive approximations to a response effective is 
that behavior is variable. No two responses are the same, and the reinforcement of 
One response produces a spectrum of responses each of which differs from the 
reinforced response along such dimensions as topography (form), force, magnitude, 
and direction. Of these responses, some will be closer to the response to be shaped 
than others and may therefore be selected next to be reinforced. Reinforcing these 
responses will in turn be followed by still other responses some of which may come 
even closer to the response to be shaped. Thus, reinforcement can be used to 
change the spectrum of responses until the response to be shaped occurs. 

This aspect of shaping, which depends on the relation of a reinforcer to the 
responding that produced it, is sometimes supplemented by another and more 


Beneral effect of reinforcers. Some reinforcers, such as food, often increase activity 
Ces, the delivery of food to a food-deprived pigeon makes the pigeon active; it 1s 
force the holding of a posture: cf. Blough, 


therefore di a 
ifficult to use food to rein 
1958). A response more closely approximating the response to be shaped than রঃ 
One most recently reinforced may occur not only because reinforcement a ্র 
Such responses more likely but also because some reinforcer deliveries make 
Organism more active. - ক 
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acknowledged this problem in chapter 2 but did not ir Ae 
Will see that we must treat not single responses but rather i 


OPERANTS: DIFFERENTIATION 
AND INDUCTION 

If we watched a rat’s lever presses, we might see the rat press the lever with কং 
Paw or with both paws or by sitting on it Or perhaps even by biting it. Eac f a 
different response, and even if two presses were made with the same paw they 
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would not be identical. Nevertheless, we call all of these responses lever presses. 
On the other hand, the rat might make similar movements at the other end of its 
chamber, distant from the lever, but we would not call those responses lever presses 
no matter how closely they resembled the earlier responses that did operate the 
lever. 

It is not sufficient to speak of behavior only in terms of individual responses. 
Individual responses are instances of behavior and each can occur only once, 
because responses can have common properties but cannot be identical in all 
respects. The reinforcement of one response produces later responses resembling 
the reinforced response more or less closely, but the later responses cannot be 
exactly the same as the reinforced response. On the other hand, we cannot group all 
responses together without distinction, because we would be left with nothing to 
speak of but behavior in general. We must settle for an intermediate level of 
analysis, at which we speak neither of individual responses nor of behavior in 
general, but rather of classes 0f responses defined by common properties. 


Response classes 


In experiments on the rat’s lever press, the lever is attached to a switch that 
Operates whenever the rat moves the lever with sufficient force through a sufficient 
distance. The common property of all lever presses is this consequence of the press: 
each response that operates the switch qualifies as a lever press. Defining response 
classes in terms of common environmental effects is the basis both for measuring 
responses in the class and for imposing experimental operations on these responses. 
For example, an experimenter could measure lever presses by recording operations 
of the switch and arrange that all responses with this effect are reinforced with food 
pellets. 

But the response class established by the experimenter has no behavioral 
significance without taking into account the effects of the experimental operations 
imposed on this class. The experimenter must ask a fundamental behavioral 
question: can the consequences of responses in this class modify the likelihood of 
responses in this class? If so, the class may be called an operant class; an operant 
class is a class of responses affected by the way in which it Operates upon or works 
on the environment. Lever presses and key pecks are convenient examples, but 
more extensive and complex cases are included among operant classes. 

Early in the psychology of learning, when Operant behavior was called instru- 
mental or voluntary, it was assumed that only responses of the skeletal musculature 
could enter into classes modifiable by their consequences. Other classes of 
responses called autonomic, such as those of glands and smooth muscles, had not 
been shown to be modifiable by their consequences. Such responses were typically 
elicited, and Pavlov’s conditioning procedures (see chapters 2 and 9) had shown 


how new stimuli could come to elicit them. At that time, there was negligible 


evidence that these responses could be modified throu 


gh reinforcement or punish- 
ment. 
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by or RELA that autonomic responses could sometimes be modified 
ss sequences changed this view. Consider the salivary response. Salivation 
pontaneously as well as when elicited by a stimulus such as food in the 
mouth (e.g., Zener and McCurdy, 1939; cf. chapter 3). Consequences could be 
arranged for spontaneous or emitted salivation, as measured in drops of saliva 
brought out from a dog’s salivary duct to an external recording system. The 
consequences of salivation could not be food, however, because the effect of food 
AS a reinforcer would be hard to distinguish from the effect of food as an eliciter 
Of salivation. Water was therefore used to reinforce salivation in water-deprived 
dogs (N. E. Miller and Carmona, 1967); water does not elicit salivation. Salivation 
Increased when it produced water (reinforcement) and decreased when it prevented 
the delivery of water (punishment). Responses in this autonomic class, salivation, 
Were modified by their consequences, in other words, these autonomic responses 
could appropriately be called an operant class. 
Another example is illustrated in Figure 6-1. Because autonomic responses 
Can be modified by skeletal responses (as when physical exertion raises heart rate), 
and because they are assumed typically to vary together (as when excitement or 
relaxation jointly affect heart rate, breathing, and gastric secretion), a convincing 
demonstration of operant control over autonomic responses should eliminate 
Skeletal responding and should show the independence of different responses. This 


Was accomplished with groups of rats paralyzed with curare (N.E. Miller and 
an event known to reinforce such 


Banuazizi, 1968). Electrical brain stimulation, 

responses as lever pressing in the rat (Olds and Milner, 1954), was used as a reinfor- 
cer. This reinforcer was made contingent upon intestinal contraction or intestinal 
relaxation, and upon heart rate increases Or heart rate decreases. Figure 6-1 shows 
that these two classes of autonomic responses were independently affected by their 
reinforcers. Some questions have since been raised about whether these and similar 
Studies are reproducible (N. E. Miller, 1978), but the procedure nevertheless remains 


8 methodological tour de force in the differentiation of response classes. 
An operant is a response class that can be modified by the consequences of 


the responses in it. This definition of a response class depends on behavioral 
Properties of responding and not on independent physical or physiological 
Properties (such as the somatic-autonomic distinction). These behavioral properties 
are based on the operation called differential rein. prcement, the reinforcement only 
Of responses that fall within a specified class. This operation makes subsequent 
tesponding conform more and more closely to the defining properties of the class. 

he essential feature of an operant is the correspondence between a class of 
“esponses defined by its consequences and the spectrum of responses generated by 


these consequences. 
A hypothetical example of differential reinforcement 
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Gollub, 1966; Eckerman and Lanson, 1969). A series of photocells permits us to 
measure where the rat pokes its nose through the slot. We will label successive 
2-centimeter segments along the length of the slot as positions 1 through 15, 
reading from left to right. Food pellets can be delivered into a cup in the wall 
Opposite the slot. Pellet deliveries are accompanied by a distinctive noise, and the 
rat quickly comes to the cup and eats whenever a pellet is delivered. 

In the absence of reinforcement, the rat spends only a little time near the 
slot. Occasionally it sniffs at the slot and puts its nose in it as it moves about, but 
these responses are relatively infrequent and do not have any systematic relation to 
the positions along the slot. A frequency distribution of the rat’s responses as a 


function of position is shown in A of Figure 6-2. 
Now let us reinforce the rat’s poking of its 


the nose-poke occurs at positions 9, 10, Il or 1 fl 5 
ment, illustrated in B of Figure 6-2, is to increase responding at all positions; the 


effect is not restricted only to those positions correlated with reinforcement. This 
Phenomenon, the spread of the effect of reinforcement to other responses not 
included in the reinforced class, is called induction (an occasional synonym is 
response generalization). In the example, reinforcing responses at positions 9 
through 12 affected not only responding at those locations but also responding at 


0 Eh é 
ther positions across the entire slot. I RE 
As we continue differential reinforcement, reinforcing responses at positions 


9 through 12 but not at other positions, responding gradually increases at positions 
correlated with reinforcement whereas it decreases at the other positions. These 
effects are shown in C, D and E of Figure 6-2. Eventually, most responses are 
Within the boundaries that determine whether a response is to be reinforced, as in 
E,and a point may be reached at which, even though some unreinforced responses 
Still occur, the distribution of responses across positions does not change with 


Continued differential reinforcement. ME 
In the example, the distribution © emitted res S { 
closely to the bounies of the class of reinforced responses. This process is called 


differentiation, and responding established in this way is said to be differentiated. 
ifferential reinforcement established a response class defined in terms of response 
Position. Yet if the distribution Of responses in E represents the maximum 
differentiation possible with this procedure, what can be said about ka responses 
that continue at positions 6, 7 and 8 or at positions 13, 14 and 15? They are 
Outside the boundaries of the class of responses correlated with কাপল 
According to a strict interpretation of the defining properties of operants they 
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jo f the operant class. They can at least Dp 
লু জা ক “ are $0 close to the class of reinforced 


In ter i as sSDOnN i 
responses সৰা পট 5 Position that the effects of reinforcement have 
Spread to them from the reinforced class. This view simply attributes responding 
Within the boundaries of the reinforced class to differentiation and responding 
Outside these boundaries to induction. The same operation generates responding 
Outside as well as inside these boundaries, and this responding can be represented 


as Fl 2" CY 
One continuous distribution. 
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ES Es oo Es that this example involves two 
Deon Fe SE a ্ ন: e রং or the CDETaLOn of reinforcement and 
(Sod hed vertical lines in ‘igure 6-2; these lines show how stimuli 
ED s) epend upon response position. The lines may be thought of as 
OAD a distribution of stimulus probabilities as a function of position; the 
2 ility of a pellet is 1 .0 following responses in positions 9 through 12 and is 
ro elsewhere. This distribution defines a class of responses in terms of the conse- 
EL of responses within that class. The second class of responses is given by the 
re al performance resulting from the reinforcement operation. This class is 
presented at any time by the current distribution of responses (e.8., early in the 
course of differential reinforcement, as at C, or late, as at E). The two classes need 
ECTS exactly. In fact, the degree of correspondence between the behavior 
find is reinforced and the behavior generated by this reinforcement is a 
amental dimension of any class of reinforced responses. 

Let us now consider another hypothetical example, illustrated in Figure 6-3 

(ef Herrick, 1964). Again we use photocells to record the positions at which the 
rat pokes its nose through a slot, but this time we use a vertical rather than a 
horizontal slot. The fifteen positions are numbered consecutively reading from the 
bottom to the top. Before reinforcement begins, the rat occasionally puts its nose 
in the slot as it sniffs about the chamber, but these responses occur predominantly 
at the lower positions, as in A of Figure 6-3. Perhaps a response would eventually 
Occur at position 15, at the top of the slot, if we were patient enough. But if we are 
Interested in differentiating responding at the upper positions of the slot, shaping is 
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Figure 6-3. Hypothetical Tesponse distributions illustrating the effects on response 
location of differential reinforcement of Successive approximations to a response 
(Shaping). The situation is Similar to that in Figure 6 
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15, but with the maximum at position 12). Responding was obviously modified by 
ins consequences, but some of this responding was outside the boundaries of the 
criterion for reinforcement. Thus, the operant class must be defined in terms of 
the relation between the environment (the consequences that it arranges for 
responses) and behavior (the responding produced by these consequences). For 
convenience, we will occasionally speak of an operant solely in terms of a class 
defined by consequences or solely in terms of the distribution generated by these 
consequences, but it is important to remember that the operant is more strictly 
defined in terms of the correspondence between these two classes (cf. Catania, 


19731). 


Differentiable properties of behavior 
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h dimensions as force of a lever 


Reinforcement inevitably incl 
Presses and key pecks must occur at a par 
Or they will not produce the consequences arrange 
examples of Figures 6-2 and 6-3 by substituting suc 
Press or a key peck. Responding can vary not only in its location or its force but 
also in its topography or form, its duration, its direction, and so on. Thus, because 
differential reinforcement can be based on any dimension of responding, any 
dimension may provide the defining properties of an operant class. | 

Figure 6-4 shows a case of response differentiation in a human (Hefferline 
and Keenan, 1963). The response was an invisibly small twitch of the thumb, 
measured in microvolts of muscle contraction. The reinforcer for thumb 
Contractions in the 25 to 30 microvolt range Was the addition of counts each worth 
a nickel to a visual display. Relative to baseline responding, in which thumb 
Contractions in the reinforced range were infrequent, responses in the reinforced 
class interval together with those in the adjoining class intervals increased whereas 
responses in the 10 to 20 microvolt range decreased. Induction to higher voltage 
Contractions was also evident. In extinction, the form of the distributor began to 
return toward that of the baseline distribution. For each participant this procedure 


differenti ne ed with reinforcement, but 
entiated thumb contractions 1 ্‌ 

none was able to report on the thumb twitching or on what had produced the 
reinforcers. This experiment has been cited to defend th 
Of reinforcement do not depend on an organisms awaren 
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Tne 65; Latency distributions of Pigeon | 
Nd were timed from the onset of the key light, and 
ob food if its latency exceeded some criterion value. 

tained at each of several latency criteria, ranging from 0 


ar j a ি্‌ 
eas represent unreinforced pecking and unfilled areas repres 


seconds or less, the peak of the 
Jue longer than the criterion latency and more pecks 


Were reinforced than unreinforced (distributions at the front left). When the 
EO latency became longer, however, relatively fewer pecks were long enough 
0 qualify for reinforcement, and at a criterion of 48 seconds only a very small 


Proportion of pecks was reinforced (rear distribution). } 
The tendency for responding to exceed the reinforcement criterion at short 
latencies and to fall below it at long latencies is a common feature of many 
Performances generated by differential reinforcement of temporal properties of 
behavior, including performances of humans as well as those of pigeons and rats 
(Catania, 1970). This may depend in part on the typical asymmetry of such 
Procedures. The procedure places a lower limit but no upper limit on reinforced 
latencies; latencies well above criterion at short minimum reinforced latencies 
Would presumably be shorter if an upper as well as a lower limit were imposed (e.£., 
stead of reinforcing any peck with a latency of two seconds or more, We might 
reinforce only pecks with latencies between two and three seconds). 
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einforcement. When the criterion latency was ten 
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havior is relevant to schedules 


al properties of be 
We will see that substantial 


The differentiation of tempor 
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quantities of behavior can be maintained even when only an occasional response 
produces a reinforcer. In reinforcement schedules, responses can be made eligible 
for reinforcement on the basis of the number of responses already emitted, the time 
elapsed since some event, or some combination of these conditions. A property of 
particular interest is the separation of responses in time. For example, one will be 
successful in starting a car with a flooded engine only if one waits long enough 
after the last attempt to start it; conversely, one will be successful in inflating a 
bicycle tire with a pump that has a leaky connection to the valve only if one pumps 
rapidly enough. Analogously, reinforcement might be arranged for a pigeon’s key 
peck only if that peck had been preceded by ten seconds of no pecking, or only if 
pecking had been emitted so Tapidly that at least twenty pecks had occurred within 
the last five seconds. In the first Case, pecking decreases; in the second, it increases. 
Yet to concentrate solely on the pecks might lead us to say that the first case was 
not an instance of reinforcement. In fact, the unit of responding reinforced in the 
first case was not a peck but rather a sequence including a pause plus a peck.. To 
the extent that this combination increased in freqnency (thereby demonstrating 
that the reinforcers had affected the response class that produced them), the rate of 
pecking necessarily decreased. 

The procedure in the first example, in which a response is reinforced only if 
preceded by a minimum time without a response, is called a differential reinforce- 
ment of low rate or DRL schedule (Ferster and Skinner, 1957). The procedure is 
Sometimes also called a schedule of interresponse time or IRT reinforcement 
(Malott and Cumming, 1964), because reinforcement is based on the spacing in 
time of individual responses rather than on the average rate generated by many 
Tesponses occurring over an extended time. In general, the longer the interresponse 
time or IRT required for Teinforcement, the lower the rate of responding (Wilson 


and Keller, 1953; Staddon, 1965). In DRL performance, the rate of responding 
decreases because the likelihood of res 


increases. Thus, in DRL 
complex operant consist 
In the second example, i 


» which leads again to a decrease in reinforcer 
€ may end in a rate of pecking so low that the reinforce- 


ment criterion is never met, and the Pigeon’s pecking may cease completely. To 
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reinstate responding, the experimenter would have to establish the high-rate 


performance again through shaping. 
In the DRL schedule, however, an effect of reinforcers following pecks is that 


pecks occur more often. This higher rate of pecking means that pecks occur closer 
a time, and therefore fewer follow IRTs long enough to meet the reinforcement 
criterion. Thus, the delivery of reinforcers decreases, and the rate of pecking 
decreases in turn. But the decrease in rate simply makes IRTs long enough to meet 
the reinforcement criterion more likely. Thus, pecking oscillates between increased 
rates accompanied by decreased reinforcement and decreased rates accompanied by 
increased reinforcement. Pecking is maintained over long periods of time by such 


contingencies, allowing differential reinforcement gradually to affect IRTSs. 
The significance of these examples is that we must be cautious about taking 
rate of responding as f the effects of reinforcement. 


a fundamental measure ) 
Response rate was once regarded a f response strength (cf. reflex 


s a measure 0 
strength in chapter 3; Nevin, 1974), but with the recognition that it was simply a 
property of behavior that cou 


ld be differentiated like other properties such as force 
and topography, this view became less tenable. Although DRL responding occurs at 
a low rate, this responding is € 


asily established and maintained; and although DRH 
responding occurs at a high rate, this responding is hard to establish and fragile once 
established. 


Various other classes of diffe 
differential reinforcement of pace 
lower limit on the IRTs that can precede 


generate a fairly constant and continuing resp 
ment of other behavior or of zero behavior ( 


reinforcement schedules exist, such as the 
d responding, which sets both an upper anda 
a reinforced response and which tends to 
onse rate. The differential reinforce- 
usually abbreviated DRO) delivers a 


reinforcer if a specified time elapses without a designated response. Consider, for 
example, a DRO schedule of ten seconds. This schedule reinforces the class of all 
responses consisting of ten seconds with no pecking. But if a reinforcer is delivered 
after every ten seconds without a peck, then each peck will delay the reinforcer by 
at least ten seconds. If pecks then occur less often because they prevent or delay 
reinforcers, it would be appropriate to say pecks were negatively punished. These 
ambiguities were considered in chapter 5. in which we considered whether it was 
worthwhile to distinguish between the positive reinforcement of not-responding 
and the negative punishment of responding. We concluded there that speaking in 
terms of discrete responses Was less likely to mislead than speaking in terms of the 
absence of those responses. In name, the DRO schedule seems to raise similar 
questions, but perhaps it will be less troublesome simply because it involves specify- 


ing a procedure rather than attributing functional properties to such events as not- 
responding. 


rential- 


Complex behavior: Maze learning 


So far, we have spoken mostly of relatively simple responses, such as the rat’s 
k. But our account of differentiation indicates 


lever press or the pigeon’s key pec 
that complex sequences of responses can also be treated as operant classes. For 
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example, a rat’s negotiation of a maze might be regarded as a single Fr I 
complex response. Consider the simplest maze, the T-maze (Figure 4-2). I ত 
goalbox is empty and the right goalbox contains food, a response sequence ha 
includes running from startbox to choice point, turning right and then running to 
the right goalbox will be reinforced. Sequences that include turning left will not be 
reinforced or (if the rat is allowed to retrace its way from left to right goalbox) will 
at least not lead to the reinforcer as quickly. As the rat comes more and more 
frequently to choose the right path we may say that this T-maze performance has 
become differentiated. 

But maze learning can involve more complex sequences. Consider the differ- 
entiation of sequences of responses in the double-alternation problem (e.g., Hunter, 
1928). At one end of a central runway, a choice point provided an opportunity 
for a left turn (L) or a right turn (R), but both paths led back to the other end of 
the runway. Without being removed from the apparatus the organism could come 
back up the central runway and again choose left or right at the choice point. After 
a series of right and left turns in the appropriate order, the experimenter could 
allow the organism to find food as it returned to the central runway. In this 
apparatus, the question was whether Organisms could learn a double alternation 
Sequence, either LLRR or RRLL. One function of these experiments was to 
compare the behavior of different Species, and in fact the sequence was learned 
with varying degrees of success by different species and by different organisms of 


the same species. A more general issue was whether the organism's behavior at one 
time could determine its behavior at 


a double-alternation sequence succes: 
Somehow took into account the tur! 


components (e.g., if there are more R’s than L’s in learnin 
because only R is ever followed immedi 


Y complex response 

sequences (e.g., Fitts, 1964; Bilodeau, 1969). 
As an organism learns response s 
tial reinforcement acts upon the b i 
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5 er Ina discrimination task with light and dark alleys, in which light-alley 
EY EE a maze in which correct choices were correlated with 
Tate) se ), a rat at first might mostly choose the dark alley, then 
Ont iy t alley, and then mostly the right alley, until finally it began 
Rr ntly choosing only the light alley. These systematic patterns were noted by 

echevsky (1932), who called them hypotheses and discussed their implications 


as follows: 


Almost every description of animal learning includes such phrases as “random 
exploratory movements”; “chance entrances”; “chance errors.” It is either 
implicitly assumed or explicitly stated in most descriptions of learning that in 
the beginning of the process the animal is a ‘chance’ animal. His responses 
are without purpose, without form, and without meaning. Helter-skelter trial 
and error seems to be 


the rule at first, and then, after such behavio 
eventually led the animal to ex 


| k perience the “correct” pattern, the various laws 
of learning step in to stamp in the correct responses and stamp out the 
incorrect. (Krechevsky, 1932; p: 157) 


He then contrasted this view with a different one, based upon his data: 
The animal... brings to each new situation a whole history of experiences. 
These experiences the animal is ready to apply. From the very beginning, 

ightforward, com- 


perhaps, the animal ing his problem in a Strail 
prehensive manner i a meaningful part of his total 


wherein each response Ban 
behavior. The animal, in executing f movements which we call 


a series 0. 
“perfect,” “errorless,” “jearned,” “in hing which 


tegrated,” is not doing somet 
has arisen from a series of “mperfect,” “unintegrated,” “chance” responses. 
ing through a dif 


He is now merely runnin ferent set of integrated responses, which 
series of integrate. by other just as integrated 
responses. Such responses, “early systematic attempts," 
etc., we have dubbed with t of “hypotheses.” . . . When the 
human individual behaves in th ch and such way We must also 
say that he has an “hypothesis.” However, We are primarily interested not in 
defending our terminology but in describing certain behavior. The term “hy- 
pothesis” has merely been chosen as a convenient tag for such behavior. 


(Krechevsky, 1932, PP- 528-29; cf. Levine, 1966) 


he dubious name 
e very Same Su 


Response sequences: Chaining versus temporally extended units 

Once we break down a behavior sequence into components, we may begin to 
treat the sequence as 8 succession of different operants, each defined by the 
reinforcing consequence of producing an opportunity to engage in the next until 
the sequence is terminated by a reinforcer. Such a sequence is called a response 
chain. An example was provided, in chapter 4, by Sidman’s detailed analysis ofa 
rat’s lever pressing reinforced by food. Rising up to the lever produced contact with 
the lever, which set the occasion for pressing the lever, which produced a seen food 
pellet, which set the occasion for moving to the food cup, and so on. At each stage, 
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behavior is consequential, and any given segment serves the dual function of 
reinforcing the last response and producing the conditions that occasion the next 
one. 

Some behavior sequences reasonably seem to be reducible to smaller units in 
this way, and an analysis into such components can be confirmed experimentally 
simply by seeing how independent the components are from each other (cf. 
Skinner, 1934b). For example, if pressing the lever no longer produces a food 
pellet, we may find that lever pressing decreases, but we may also find, by 
delivering food pellets independently of lever pressing, that a seen food pellet 
continues to occasion movement to the food cup. This procedure demonstrates 
that the integrity of one component is not affected by altering the reinforcement 
contingencies for another. We could also find out about components by 
determining whether substitutions can be made. For example, we might eliminate 
the seen food pellet by conducting the procedure in darkness and find that the 
sound of a food pellet dropping into the food cup could take over the former 
function of the seen food pellet. 

The critical question is not so much whether some behavior sequences are 
held together in this way, but rather whether this is the only basis for the develop- 
ment of behavior sequences. In the history of the psychology of learning, the 
positions taken on this issue were often symptomatic of serious divisions among 
researchers of different orientations, some holding that sequential behavior could 
always be interpreted in terms of the concatenation of such components (variously 
called associations, chains, S-R or stimulus-response bonds, or conditioned reflexes, 
among others), and others holding that sequential behavior could not be interpreted 
adequately in such terms. We will conclude that sequential behavior of both sorts is 
possible; the significant experimental question is to determine the nature of the 
serial ordering in any particular case. 


Perhaps the most telling argument was made by Lashley (1951), who 
summarized earlier conceptions as follows: 


+. the only strictly physiological theory that has been explicitly formulated 
to account for temporal integration is that which postulates chains of 
reflexes, in which the performance of each element of the series provides 
excitation of the next. This conception underlay the “motor theories” of 
thinking which... sought to identify thought with inaudible movements of 
the vocal organs, linked together in associative chains. The peripheral chain 
theory of language... described speech as a succession of vocal acts in which 


the kinesthetic impulses from each movement serve as a unique stimulus for 
the next in the series. (Lashley, 1951, p. 114) 


Lashley then described cases, including illustrations from both language and music, 
that made such accounts implausible. 

For example, in answer to the argument that each movement serves as a 
unique stimulus for the next, Lashley considered the complex sequence of 
movements required to pronounce the sounds of the word “Tight” in proper order. 
The order is not given by the sounds themselves, because the sounds can occur in a 
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variety of orders and combinations (e.g., in the opposite order, as in “tire”). Thus 
the sound sequence cannot be based solely on direct connections, but must degen 
on some larger organization. Lashley extended the case from sequences of sounds 
within words to sequences of words within sentences: 


The word “right,” for example, is a noun, adjective, adverb, and verb, and has 
four spellings and at least ten meanings. In such a sentence as “The mill- 
wright on my right thinks it right that some conventional rite should 
symbolize the right of every man to write as he pleases,” word arrangement 
is obviously not due to any direct associations of the word “right” itself with 
other words, but to meanings which are determined by some broader rela- 


tions. (Lashley, 1951, PP. 115-16) 


ut that the sheer rapidity of some 


And Lashley used music to point 0 
ht have been generated: 


sequences constrained how the sequences mig 


ian may reach sixteen per second in passages 
which call for a definite and changing order of successive finger movements. 
The succession of movements is too quick even for Visual reaction time. In 
rapid sight reading it is impossible to read the individual notes of an arpeggio. 
The notes must be seen in groups... Sensory control of movement seems to 
be ruled out in such acts. (Lashley, 1951, p. 123) 


The finger strokes of a music. 


nt, then, was that some sequential patterns of responding 
succession of stimulus-response or S-R units. When a skilled 
the, these letters cannot be discriminative stimuli for 


the next stroke, first because the typist will be executing that next stroke even 
before the typed letters on the page can have any stimulus effects, and second 
because these letters cannot be a unique discriminative stimulus if they can 
occasion hitting the space bar or various other keys depending on whether the 
typist is typing the word the or these or then Or thermometer. 

The historical problem was probably that, in the face of such arguments, 
researchers felt that a choice had been forced between assuming that sequential 
behavior depended upon stimulus-response sequences and assuming that it 
depended on temporally extended units of behavior not reducible to such 
Sequences. Instead, we may regard the issue as an experimental one. Clearly some 
sequences can be put together in such a way that each response produces stimulus 
conditions that set the occasion for the next response, whereas others must be 
integrated in such a way that the responses appear in the proper order without each 
depending on the consequences of the last. For any given behavior sequence the 


issue is deciding which type of sequence itis. 


Lashley’s argume 
cannot be reduced to a 
typist rapidly types the letters 


Limits on differentiation 
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Sometimes the basis for such differences raises no 
e, we do not expect flight to be the same in 


Some properties of res 
reinforcement than others. 
theoretical questions. For exampl 
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pigeons and bats and bees, and we ordinarily do not even ask about the possibility 
of flight in the rat. The differences are obviously anatomical, and they illustrate 
how much we take limitations on an organism’s motor or sensory capacities for 
granted. But a difference in the effect of differential reinforcement on different 
responses is more likely to raise questions when it has no clear anatomical basis. 

For example, like motor capacities the capacities of various stimuli to 
reinforce differ across species. Before the relativity of reinforcement was 
recognized, findings in which established reinforcers for a species failed to have 
their characteristic effects were hard to deal with. Breland and Breland (1961) 
described several cases and used them to argue against the generality of 
reinforcement as a behavioral process. In one demonstration with raccoons, food 
reinforcers were delivered when a raccoon picked up coins from the ground and 
deposited them in a container. After some repetitions of the procedure, the raccoon 
began persistently to rub the coins together instead of releasing them into the 
container. Contrary to the Brelands’ argument, this finding did not invalidate the 
Principle of reinforcement. A raccoon ordinarily rubs and washes its food before 
eating it, and the coins apparently provided a better opportunity for this behavior 
than the food that was supposed to function as a reinforcer. The demonstration 
showed that rubbing was sufficiently more probable than eating that food was not 
effective as a reinforcer; and we might also assume that an opportunity for rubbing 
could have been effective as a reinforcer for other responses, perhaps including 
eating. 

Demonstrations of limitations on properties of behavior that can be differen- 
tiated do not invalidate our analysis, because our account has been mainly 
concerned with naming phenomena and relating them to one another. When an 
experimental procedure is applied, there is no guarantee that the procedure will be 
effective. A procedure that leads to learning with one response or one organism 
may not do so with a different response or a different organism, and a response or 
an Organism affected by one procedure may be unaffected by other procedures. 
No single process accounts exhaustively for the phenomena of learning, and part of 


A of an analysis of learning is to explore the limits on the processes that we 
study. 


SUMMARY AND EXTENSION: 
THE STRUCTURE OF BEHAVIOR 


We have seen how behavior can be changed through shaping and how differential 
reinforcement is involved in generating response classes. The close correspondence 
between the class of responses with consequences and the class of responses 
generated by these consequences is the criterion for speaking of an operant class. 
These classes may be defined along single dimensions such as force or location, or 
may have more complex Properties. Our examples included differential-reinforce- 
ment schedules and integrated response sequences. Questions about differentiable 
properties of responding are concerned with the structure of behavior. When we 
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describe such properties, we must include how they are measured; the measuring 
techniques may vary from one response to another. 

In each case, our major interest is with the dimensions along which 
responding conforms to the class of responses that is reinforced. The structure of 
behavior is such that we cannot always define such dimensions independently of 
the reinforcement contingencies. For example, consider the reinforcement of novel 
porpoise (Pryor, Haag, and O’Reilly, 1969). These 
investigators produced novel performances by selecting for reinforcement, in each 
experimental session, some class of responses not reinforced in any previous 
session. After several sessions, the porpoise began to emit responses, in each new 
session, that the experimenters had never seen before. Response novelty had been 
differentiated, but how else is this operant class to be specified except by describ- 
ing the criteria for reinforcement? The fact that we have difficulty measuring them 
does not rule out originality or other complex dimensions of behavior as properties 
that may define an operant class. Even the variability of responding itself can be a 
basis for differential reinforcement (Schoenfeld, Harris, and Farmer, 1966). 

We will return to these issues when we discuss verbal behavior. Sentences can 
be differentiated into grammatical classes such as active versus passive voice, Or 
declarative versus interrogative. Verbal behavior is consequential behavior, the 
finding that some grammatical structures are more easily Jearned than fs £ 
similar in kind to the finding that rats learn single-alternation OE ম ) 
more easily than double-alternation sequences (LLRR). In each Ed hE 
is to identify the dimensions along which responding may 5 2 Ea SR 
class of responses that has consequences. For ‘any PATUC ar en 0 tre 
classes may be more easily established along some dimensions El 
These dimensions may be simple or complex, intensive Co iE 
measure or difficult. Perhaps EAE ৰ ed 0 earlier 
description (e.g., the class of all responses not seen or no A OL 
Occasions, as in reinforcing the porpoise’s novel reponse Hm a fineHonal 
dimensions is a structural concern, and must be TIE ie ib he differential 
Concern, such as an analysis of the changes in behavior চা দা I ts ECE 
Consequences of responses along these dimensions. We iscuss EE 
function of behavior in chapter 1. The present IS 5 2 te 
these two properties of behavior are not incompatible; {hey je) 
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The words differentiation and induction applied to responses classes are 


paralleled by discrimination and generalization applied to stimulus classes. 


These words have no obvious etymological ties. Differentiation, from the 
d through the Indo-European 


Latin dis-, apart, plus ferre, to Carty, is relate. 
bher-, to carry or bear, to such words as birth, transfer, and metaphor (but 
in, plus ducere, to lead, is 


not interfere). Induction, from the Latin in-, 
related through the Indo-European deuk-, to lead, to such words as duke, 


educate and conduct. Discrimination, from the Latin dis-, plus crimen, judg- 
ment, is related through the Indo-European skeri-, to cut or separate, to such 
Words as crime, describe, and criterion. And generalization, from the Latin 
genere, to produce or cause, is related through the Indo-European gen-, to 


give birth or beget, to such words as ingenious, kind, and nature. 


ly with respect to response 
t to the dimensions of stimuli in the presence of 


Which responses occur. For example, a rat's lever presses in light are different 
its presses in darkness, and reinforcement can be arranged for lever presses a 
Presence but not the absence of light. Similarly, a pigeon’s key pecks in ত be 
are different from its pecks in red light. If or Fe nn nt 5 a 
Say that rei key pecks is correla - : 
বস তপ "টল Reinforcement with respect to stimulus properties 
is called a discriminated operant. | on 
Discriminated operants are 8 pervasive aspect of SBE pd Mires 
Proceed through an intersection if the traffic light is green but 1! re A 
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that failed to discriminate the appropriate place at jo 2 টা ld লী 
Would repeatedly bump the walls as it moved through the maze, ly 


master the maze as a whole. 


Responses can be differentially reinforced not On. 


dimensions, but also with respec 
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Discriminative stimuli correspond to the stimuli colloquially called signals or 
cues. They do not elicit responses. Rather, they set the occasion on which responses 
have consequences, and are said to occasion or evoke responses. An example of the 
development of stimulus control, or of the control of responding by a 
discriminative stimulus, is shown in Figure 7-1 (Herrick, Myers, and Korotkin, 
1959). Rats pressed a lever in the alternating presence and absence of light from a 
small lamp. When the light was on, lever presses were Occasionally reinforced with 
food. When the light was off, lever presses were not reinforced. The notation for 
the stimulus correlated with reinforcement is SD for discriminative stimulus or S+ 
for positive stimulus; the notation for the stimulus correlated with nonreinforce- 
ment or extinction is SA, also for discriminative stimulus (A is delta, the Greek 
letter d), or S- for negative stimulus (strictly, So would be more appropriate, 


because nothing is removed in the presence of this stimulus, but S- has become the 
more common usage). 


Herrick, Myers & Korotkin (1959) 
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In the procedure of Figure 7-1, light and dark alternated irregularly; when the 
lamp was lit, it remained on for varying periods ranging from five to thirty seconds. 
In addition, lever presses were reinforced according to a variable-interval or VI 
schedule of reinforcement: on the average, only one lever press per thirty seconds 
was reinforced when the light was on. For the present, the important features of 
this schedule are that it maintains responding and that the varying times between 
successive reinforcers make the time since the last reinforcer an unreliable predictor 
of when the next response can be reinforced. In these circumstances, if the rat 
presses more in the light than in the dark the light may be regarded as the 
controlling stimulus; such a rate difference could not depend on the time between 
stimulus changes or the time between reinforcers because the temporal cycle of 


these events was unsystematic. 
Figure 7-1 shows that over sessions l 
its absence it decreased. The increase is a 


ination procedures. The figure also shows chan 
responding in the light as a percentage of total responses. The index increased over 


sessions. This outcome may be stated in many ways: lever pressing in the presence 
of light was a discriminated operant; the light functioned as a discriminative 
stimulus for lever pressing; Or, lever pressing was under the stimulus control of the 


light. 


ever pressing in light increased whereas in 
frequent accompaniment of such discrim- 
ges in a discrimination index, 


THE NATURE OF DISCRIMINATED 

OPERANTS 

d operants with a hypothetical 
e 6-2. Again we have a rat in a chamber with a slot 


in one wall, but this time the rat cannot poke its nose into the slot. Instead, the slot 
is covered by translucent plastic, and a series of fifteen lamps behind the plastic 
can illuminate successive segments of the slot. In other words, the lamps provide a 
stimulus dimension the components of which are analogous to those of the 
response dimension of Figure 6-2. Centered beneath the slot isa lever, and the food 
cup of a pellet dispenser is on the opposite wall. Let us now light the lamps behind 
the strip for fixed durations but in irregular order and record the lever pressing that 
Occurs in the presence of each. If we do not reinforce lever presses, we can expect 
pressing to be infrequent, and we would not expect the presses to bear any 
systematic relation to the portion of the strip that is lit. In fact, we might expect 
data much like those in A of Figure 6-2, with the main ference that the x-axis 
Would represent stimulus position instead of response position. at, j 

At this point, we begin to reinforce lever presses only if the slit is lit at posi- 
tions 9, 10, 11 or 12; presses are not reinforced when a light comes on in any other 
position. The initial effect 0 t is much like that shown in B of Figure 
6-2: an increase in responding tions. The effect of reinforcement 
is not restricted only to stimu lated with reinforcement; instead, 


We can illustrate some characteristics of discriminate 


example related to that of Figur 


f reinforcemen 
to lights at all posi 
li at positions corre 
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the effect of reinforcement at some positions spreads to other positions. This 
Phenomenon, the spread of the effect of reinforcement in the presence of one 
stimulus to other stimuli not correlated with reinforcement, is called generalization. 
In the example, reinforcing responding in the presence of light at positions 9 
through 12 affected responding in the presence not only of those lights but also 
lights at all other positions across the entire slot. The main difference between this 
example and the original example is in the dimension correlated with reinforce- 
ment: in chapter 6 we dealt with a response dimension whereas we are now dealing 
with a stimulus dimension. 

Suppose we continue differential reinforcement with respect to stimulus 
location, reinforcing responses in the presence of lights at positions 9 through 12 
but not at other positions. Responding gradually increases with lights at positions 
correlated with reinforcement, and decreases with lights at other positions. Again 
the effects are like those of Figure 6-2,inC, D and E. Eventually most responses 
occur with lights at positions correlated with reinforcement, as in E, and a point 
may be reached at which, even though some responses still occur with lights at 
other positions, the distribution of responses does not change much with continued 
differential reinforcement. 

In this example, the stimuli in the presence of which responses were emitted 
came to conform closely to the class of stimuli correlated with reinforcement. This 
process is called discrimination, and responding that has come under the control 
of stimuli in this Way is said to be discriminated. Differential reinforcement with 
respect to stimuli established a response class defined in terms of the stimuli in the 
Presence of which responses occur. But what about responses in the presence of 
stimuli outside the boundaries Correlated with reinforcement (e.g., positions 6, 7 
and 8 or positions 13, 14 and 15)? According to a strict interpretation, they cannot 
be counted as members of the discriminated-operant class; they can be spoken of in 
terms of generalization. Yet the same Operation of differential reinforcement 


Benerates responding inside and outside these boundaries, and this responding is 
one continuous distribution. 


» We 
nt classes, but aside from arranging contingencies there 
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is not much else we can do about them. If we see a rat about to poke its nose into 
position 7 along the slot, we cannot appropriately prevent the response even if the 
rat has responded many more times at that position than at other positions. 
Suppose, however, that we were working with stimulus properties. We could 
present lights equally often at each position, or we could present lights at some 
positions but not others so that the rat would never have an opportunity to press 
the lever in the presence of some stimuli. Such cases justify saying that the stimuli 
in discrimination procedures set the occasion for responses: when a class of 
responses is defined by the presence of a stimulus, responses in this class cannot 
occur when the stimulus is absent. 
Even this methodological distinction has exceptions. For example, consider 
the differential reinforcement of interresponse times (e.g., the DRL schedule; 
chapter 6). If a pigeon’s key peck is reinforced only after at least five seconds of no 
pecking, the pigeon may come to space its pecks about five seconds apart. We 
discussed this behavior in terms of the differentiation of a complex operant consist- 
ing of a pause plus a peck. We could instead treat the duration of the pause as a 
stimulus property and argue that the behavior should be dealt with as discrimina- 
tion with respect to time elapsed since the last peck. In fact, in this particular 
instance the vocabularies are interchangeable; such ambiguities are always possible 
When the properties of responses make it easy to speak of the organism 
discriminating some property of its own past behavior. Whether we speak 0 
differentiation and induction or of discrimination and generalization, the 


i i i i tiation 
underlying operation in each case is differential reinforcement. Both differen 
yori inet ce between the dimensions upon 


and discrimination reduce to the corresponden ) 
Which Greeonad reinforcement is based and the dimensions of the Fe 
behavior, We will nevertheless maintain the established difference Lo 
Vocabularies of response properties and stimulus properties, because es 
Vocabularies have an extensive and widely accepted historical foundation. 


The problem of attention 


pondence between the stimuli with which reinforce- 
nd the stimuli to which the organism responds, 
dimension selected by the experimenter. But 
e no guarantees that the organism will 


respond t h ropertie entiation, a rat’s lever presses 
nd to j ose dey 
ny Ee Ne 5 constant form even though only force or location is the 


liably press the lever 
basis of di tial reinforcement. For example, the rat may re 
with its ey even though this property is not Ee SA 
will be reinforced. Similarly, in discrimination 4 rat migh a STERIC 
the intensity of a visual stimulus even though en Ee ae eel 
Only on its location or shape. (Stimulus DIOL ie k but ‘ TOpert of 
to respond discriminatively are sometimes called sare, St) 


In discussing the corres 
ment contingencies are correlated a 
We spoke in terms of the stimulus 


Stimuli h: j erties, and there are n° 
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salience is based on the organism’s behavior with respect to the stimulus rather than 
on its physical dimensions.) 

The concept of attention arises from discriminated responding to the proper- 
ties of stimuli (cf. Egeth, 1967; Mackintosh, 1965). Consider a Pigeon whose key 
Pecks are occasionally reinforced with food. One of two stimulus combinations is 
presented on the key: a triangle on a red background or a circle on a green 
background. Each stimulus is presented for three minutes. After three minutes of 
triangle-on-red, the next key peck in the presence of this stimulus is reinforced; 
after three minutes of circle-on-green, the stimulus turns off without reinforcement. 
This arrangement in the presence of triangle-on-red is called a fixed-interval or FI 
schedule of reinforcement; the arrangement in the presence of circle-on-green is 
extinction. We will examine the fixed-interval schedule in chapter 8. For the 
Present, it is sufficient to note that this schedule usually maintains responding 
that becomes more and more Probable as time passes in the interval, rather than the 
relatively constant rate of responding ordinarily maintained by the variable-interval 
schedule. If every peck in the presence of triangle 
then the reinforcer deliveries themselves might acquire discriminative functions. 
With FI reinforcement we do not have to Worry about such effects of reinforcer 
deliveries, because no peck can be reinforced until the interval has ended. 

Data from this procedure for two Pigeons are shown in Figure 7-2 (G. 5S. 
Reynolds, 1961b). The graphs on the left show rates of pecking in the presence of 
each stimulus compound at the end of eighteen hours of training. Both pigeons 
emitted more than forty pecks per minute during triangle-on-red, but pecked at 
relatively low rates during circle-on-green. In a test without reinforcement, each 
component of the compound stimuli was presented separately, with results shown 


-on-red produced a reinforcer, 


» On the other hand, almost 
; even though the triangle had been 
by itself this stimulus occasioned 


ed attention is not primarily an instance of 


generalization. For Pigeon 105, for example, responding did generalize from 


triangle-on-red to triangle without 
to red without a triangle. We do n 


Reynolds (1961) 
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Figure 7-2. Key pecking of two pigeons during reinforcement correlated with 
triangle-on-red and extinction correlated with circle-on-green (eft: training), and 
during extinction tests in which the components of these stimuli were presented 


separately (right: attention tests). The tests show that Pigeon 105 was responding 
ut not color whereas Pigeon 107 was responding to color 


discriminatively to form b 5 
but not form. (From G. S. Reynolds, 1961b, Figure 1.) 


stimulus dimension but not to another, it is appropriate to say that the organism 

is attending to the first dimension but not the second. h I 
Once an organism has attended to particular stimulus properties in one 
situation, it is also likely to attend to those properties in a new situation (e.g., 
also expect that we could change the likelihood 


Lawrence, 1949, 1950). We might k i 
that an organism would attend to one stimulus property or another simply by 


changing the way in which reinforcement is correlated with these properties. In the 
training conditions of Figure 7-2, form and color were correlated equally well with 
reinforcement. For this reason, the experiment was particularly suitable for a 
demonstration of some properties of attention. But if form discrimination in the 
Pigeon were our major interest, we could correlate stimuli along this dimension 
with reinforcement and make color irrelevant to reinforcement. 
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Place-learning versus response-learning. The preceding issues are indirectly 
related to a long-standing controversy in Psychology, on place-learning versus 
response-learning (e.g., Restle, 1957). We can differentiate the right turns of a rat at 
the choice point of a T-maze by reinforcing right turns but not left turns. We may 
then ask whether the rat’s responding should be characterized by response dimen- 
sions (movements to the right as opposed to movements to the left) or by stimulus 
dimensions (movements toward a particular place, without regard to the direction 
from which the rat approaches). For example, suppose the right arm of the T-maze 
pointed toward the windowed east wall of a laboratory. The rat might learn right 
turns; on the other hand, the rat might learn to run toward the brighter windowed 
side of the laboratory. We could evaluate these alternatives by turning the T-maze 
around so that the right arm now points west and the rat approaches the choice 
point from the north. If the rat turns tight, and therefore away from the windowed 
side of the laboratory, we would say that the rat shows response learning. If the rat 
turns left, toward the windowed side of the laboratory, we would say that the rat 
Shows place learning: it moved toward the Same place, even though it did so by 
turning in a different direction. The question is therefore whether the rat has 
learned right versus left turns or east versus west turns. 


j The rat’s performance depends in large part on the stimuli available both 
inside and outside the maze. Th 


and other gross features 
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hich stimuli are presented. If an experimenter is 
um (i.e., some dimension along which stimuli 
Can vary, such as intensity or position of a light), it may make a difference whether 
many or few stimuli are presented, or whether each stimulus is presented for a short 
OF a long time, or whether the stimuli include those correlated with both reinforce- 
ment and nonreinforcement or only those correlated with reinforcement. Different 
arrangements of stimuli are the basis for extensive research on gradients of stimulus 


control. 

Generalization gradients. If 
single stimulus and some property 0 
On the difference between the origi 


must decide on the order in Ww 
interested in some stimulus continu 


a response is reinforced in the presence of a 
f that stimulus is varied, responding will depend 
nal and the current stimulus. For example, if a 


Pigeon’s key pecks are reinforced when the key is lit yellow, the pigeon will 
continue to peck, though ordinarily at lower and lower rates, as the key light is 
changed to orange and to red and to violet. This demonstrates generalization: The 
effects of reinforcement in the presence of yellow spread to other colors. 

Figure 7-3 presents data on generalization of pigeons’ key pecks to tones of 
different frequency after pecks were reinforced only in the presence of a tone of 
1000 cycles per second (H. M. Jenkins and Harrison, 1960). In one procedure (no 
discrimination training), the tone Was always present and pecks were reinforced 
according to a variable-interval or VI schedule. In a second procedure (presence- 
absence training), the tone was sometimes present and sometimes absent, and pecks 
were reinforced according to the VI schedule only in its presence. After training, 
reinforcement was discontinued and tones of other frequencies were presented for 
the first time, along with no tone and the original training tone. These stimuli were 
Presented eight times each in mixed order, and the pecking in the presence of each 


i ding. 
stimulus demonstrated the control that each exerted over respon REL 
As shown for the three pigeons in Figure 7-3 (top), without discrimination 


training neither the frequency of the tone nor its presence or absence had ary 
Substantial effect on pecking. Responding Was evenly distributed a 
the gradient of generalization was relatively flat or, in other le e ৰ ৰ 
reinforcement in the presence of the original tone spread uniformly to a 0 the 
stimuli tested. But for the five pigeons given discrimination training, in Figure 7-3 
(bottom), the original frequency controlled higher rates of pels any ag 
frequency; in general, the greater the difference between the Er 0 
the tested frequency, the lower the rate of pecking. It is 2 ne re দ 
rates of pecking occurred in the absence of the fo nn ulus 
condition correlated with extinction. But response rate ন I fe RY 
even though discriminated responding had been establishe ৰ only wi ন 0 ‘ 
Presence or absence of the tone and not with respect to EES ন) 0 
Bradients in Figure 7-3 can be called generalization Er rea: 8 } ট 
followed discrimination training between stimuli on the at a i 0 j e 
gradient was obtained. (Differences in the form of genera TRE oR also 
Produced by other variables, such as the reinforcement sche ্ ; ন TE ot 
ing: e.g., Hearst, Koresko, and Poppen, 1964;D. R. Thee 1698). Ski, ;or 
the level of deprivation: €.8- W. O. Jenkins, Pascal, and Walker, fh 
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e to say that an organism is attend- 
data, and we should therefore not 
explanation for the data that 
timulus dimension A is 
likely to attend 
n of why the 


He do. We decide whether it is appropriat 
hens stimulus dimension on the basis of the 
isla. “this term, derived from the data, as an 
Gye in the first place. (Similarly, if we say that sl 
ld ient than stimulus dimension B because organisms are more 

an to B, we cannot then use this salience as an explanatio 


Organi 
ganism attended to A rather than to B.) 
Post-discrimination gradients. Stimulus-control gradients can also be obtained 
long the dimension. 


after establishing a discrimination between two or more stimuli a 
‘Bure 7-4 compares such a post-discrimination gradient with a generalization 
EL (Hanson, 1959). For one group of pigeons (generalization), key pecks in 
Sa HT of a single wavelength on the key were reinforced according to a VI 
ত ule; the rate of pecking in the presence Of a variety of different wavelengths 
nati then determined during nonreinforcement. For a second group (post-discrimi- 
the On), key pecks were reinforced according to a VI schedule in the presence of 
the same wavelength as the first group, but this wavelength alternated with another 
in the presence of which pecks were not reinforced (extinction); as in the first 
Broup, the rate of pecking in the presence of a variety of wavelengths was then 
determined during nonreinforcement. The gradient after training with a single 
Stimulus had its peak at that stim king decreased with increases in 


tt ulus; rate of pec 
0 difference between the training stimulus and the test stimulus. After discrimina- 
10. 

n between two stimuli, however, 


p the peak of the gradient was displaced (a peak 
Shi 
hift) from the reinforcement stimu 


j Jus in a direction away from the extinction 
Stimulus. Relative to the gradient obtained from the first group, stimuli closer to 
the reinforcement than to the extinction stimulus produced higher response rates, 
and stimuli closer to the extinction than to the reinforcement stimulus produced 
lower response rates. (Similar effects are also obtained when the discrimination is 
based not on reinforcement in the presence of one stimulus and extinction in the 
Presence of another, but rather on 8 higher frequency of reinforcement in the 
Presence of one than another stimulus: Guttman, 


1959). 
jy One account of the form of the stimulus-control gradient after discrimina- 
tion training is provided by Spence (1937). Spence assumed that reinforcement in 
the presence of one stimulus created a gradient of increments in responding 
Centered on the reinforcement stimulus (excitatory gradient), that extinction in 
the presence of the other stimulus produced a gradi 


ent of reductions in responding 
ড 
es on the extinction stimu toy BadiLnT, BnG TAT RR 
erent stimuli after discriminati 


lus (inhibi 
inhibi on training could be derived by subtracting the 
I itory from the excitatory gre heoretical difference gradient 
wed a peak shift: The gradient maxim 


dient. Spence’s t 
EN um was displaced from the reinforcement 
Tot a direction away from the extinction stimulus. But subtraction could 
ae NE difference gradient anywhere higher than the original excitatory 
{Reto al his account was consistent with the form but not the absolute value of 
for SE gradient. Nevertheless, Spence’s account was the foundation 
quent quantitative analyses of stimulus-control gradients (cf. Hearst 


B 
esley, and Farthing, 1970; Rilling, 1977). 
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Pigeon eats, the pigeon’s discrimination might be sharper in the yellow than in 


other regions of the spectrum even before any experimental conditions. Because we 
cannot know what color discriminations might be established in an organism's 
natural environment, an alternative approach is to create an environment in which 
Color discriminations are not possible. This can be done by raising an organism in 
a monochromatic environment, i.e., an environment lit by only a very narrow band 
of Wavelengths (e.g., the yellow light emitted by a sodium vapor lamp). To humans 
in such an environment, objects have no color; everything appears in shades of gray. 
A suitable organism for an experiment in monochromatic rearing is the duck- 
ling, because the duckling is capable of walking, pecking, eating, and drinking soon 
after it hatches, and reinforcement procedures therefore can be initiated early. 
Flat generalization gradients across wavelengths were obtained from ducklings that 
had been reared monochromatically, suggesting that color experience is a condition 
for peaked generalization gradients for wavelength (N. Peterson, 1962). This 
finding, however, has not been repeated reliably by other investigators (e.g., Tracy, 
1970). In addition, discrimination between only two wavelengths after 
Monochromatic rearing produced fairly standard post-discrimination gradients, 
including a peak shift, even though ducklings under such conditions were mostly 
responding to wavelengths they had never seen before (Terrace, 1975). The 
questions therefore remain to some extent unanswered. But once attention to 
Stimuli along some continuum has been established by differential reinforcement 
With respect to two stimuli along the continuum, we should not be surprised by 
differences in responding in the presence of other stimuli along the same continuum 
(ef. Guttman, 1956). Such differences are properties of the stimulus classes or 
discriminated operants established by differential reinforcement, and such classes 
Are undoubtedly determined by a variety Of factors, including the organism's 
Sensory apparatus and its past history of stimulus control. E 

Inhibitory gradients. Once Spence’s account had ‘Been invoked to handle the 
form of post-discrimination gradients, interest arose in finding a direct way to 
Measure the inhibitory gradient directly. The concept of inhibition was controver- 
Sial, but the difficulties were methodological as well as theoretical. To determine 
Whether a stimulus reduced responding, there had to be some responding in the 
Presence of the stimulus in the first place (cf. Honig, 1961). But once 
reinforcement had generated responding, whether in the presence of a single 


Stimulus or in the presence of many stimuli along the continuum, the effects of 
ly with reinforcement schedule, number of 


extinction would vary considerab 

reinforcement sessions, and other factors. A procedure was needed that would 

make the dimensions along which ar extinction gradient was determined 
elated with reinforcement (H. M. Jenkins and 


independent of the dimensions corr : 
Harrison, 1962; H. M. Jenkins, 1965; Hearst, Besley, and Farthing, 1970). Such a 
Procedure is illustrated in Figure 7-5 (Honig, Boneau, Burstein, and Pennypacker, 


1963). These investigators worked with one group of pigeons for which the 
reinforcement stimulus was a vertical line on the key and the extinction stimulus 
Was a lit key without a line, and a second group for which these stimuli were 
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Figure 7-5. Excitatory and inhibitory stimulus-control gradients after variable- 


mulus (SD) and extinction in the 
igeons are shown for two different 
udy, a vertical line was correlated 
with extinction for one group, and 
up. Responding in the presence of 
f a line was determined during non- 
in, and Pennypacker, 1963, Figure 1.) 


reversed. For both groups, variable-interval (VD) schedules were used during 
training, and stimulus-control gradients along the dimension of line Orientation 
were Obtained during nonreinforcement. 

Figure 7-5 shows data for two such groups from each of two studies. When 
the vertical line was correlated with reinforcement and the absence of a line with 
extinction, the rate of pecking was lower with greater deviations from vertical 
(triangles); for these pigeons, the effect is analogous to that of Figure 7-3 (bottom). 
Just as peaked gradients with respect to frequency of a tone were Obtained after 
discrimination training with respect to presence or absence of a tone, peaked 
gradients with respect to line orientation were Obtained after discrimination 
training with respect to presence or absence of a line. But when the absence of the 
line was correlated with reinforcement and the vertical line was correlated with 
extinction, the rate of pecking was higher with greater deviations from vertical 
152 
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(circles). In a post-discrimination gradient, the effects of a stimulus are necessarily 
determined by the location of both the reinforcement stimulus and the extinction 
stimulus (e.g., if stimulus B on a continuum is between reinforcement stimulus A 
and extinction stimulus C, it cannot be moved toward one without also being 
moved away from the other). In the present instance, changing the orientation of 
the line affected the distance of the stimulus from the vertical extinction stimulus 
while presumably having no effect on its distance from the reinforcement stimulus, 
Which was the absence of a line. te | 
Questions can always be raised about the dimensions of the stimuli to which 
the organism is attending. For example, Suppose it was argued that the pigeon was 
looking only at the very upper edge of the key. If the line is shifted from vertical, 
its top moves to the right or left, and at its upper edge the key more closely 
resembles a key without a line. It may also be important that organisms are more 
likely to attend to properties of stimuli correlated with reinforcement (feature- 
positive discrimination) than to those correlated with extinction (H. M. Jenkins and 
Sainsbury, 1970). Such stimulus factors could influence the form ofa Ho Our 
judgment must be based on the pigeon’s behavior, and if we wished to eva ee 
these possibilities, we would have to do so experimentally (e.g., by A e 
lower or middle portions of the line in Lg "ONG and seeing whether 
th iseon’s behavior: cf. Touchette, bie 
ey ae er gradients seem to imply that EE 
Some kind of active suppression of responding rather than ney the e 
Outcome of discontinuing reinforcement. This view follows from t e SE 
between the concentration of the effects of reinforcement 3 0 ie 
reinforcement stimulus and the concentration of the effects El on 
region of the extinction stimulus. Yet it is difficult to say whet t + ছাপি on 
Procedure itself, the stimulus, or some other event should be 3 a je লা 
inhibitor: accounts in terms of inhibition are not Bh he ত 
is said to be inhibited but no inhibiting event can be speci ie NR Ee 
inhibition might more appropriately be reserved for cases aE 
inhibited and inhibiting events can be specified (as when NE 
ment of one response reduce some other response, €.g., Ca j 


Stubbs, 1974). 


Fading: Stimulus control by 

successive approximations 

roperties that define an operant class can be gradually 

hn Tot 5 the stimulus properties that Ne a op 

Operant class can be gradually changed by analogous lures J 8 

Instead of exposing an organism directly to difficu t-to-discrimin! imuli, 
minate stimuli and then gradually move to 


training can begin with easy-to-discri imuli gradu: J 
the role aiffcult stimuli. For example, for a pigeon § key pecking reinforced with 
food, it is usually harder to establish a discrimination between vertical and 
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horizontal lines than between red and green. Once a discrimination between red 
and green is established, however, the discrimination between vertical and horizon- 
tal can be successively approximated by superimposing vertical on red and 
horizontal on green and then gradually fading out the colors (Terrace, 1963b). 

Establishing stimulus control through fading is often effective (e.g., Hively, 
1962; Sidman and Rosenberger, 1967; Sidman and Stoddard, 1967), but as with 
shaping no simple rules exist to determine how rapidly stimuli should be faded in 
Or out in particular situations. For example, after we had superimposed vertical on 
red and horizontal on green and had partially faded out the colors, we might find 
that we could remove them completely, leaving responding under the stimulus 
control of vertical and horizontal. On the other hand, we might instead find that 
even after the colors were very dim they remained the stimuli to which the 
Organism was attending, so that discriminated responding disappeared each time we 
dimmed the colors below certain threshold levels. The former would represent 
successful fading, but the latter would represent failure to transfer the discrimina- 
tion from color to line orientation. 

Just as shaping requires some behavior available to be shaped, fading requires 
discriminative responding available to be shifted to a new stimulus dimension. 
Consider, for example, a phenomenon called errorless discrimination learning 
(Terrace, 19631). Shortly after pigeons’ pecks on a red key had been shaped with 
food reinforcement, reinforcement in the presence of red was continued according 
to a variable-interval schedule. Three-minute periods of red alternated with another 
stimulus, during which pecks could not be reinforced. At first, this other stimulus 
consisted of a dark key that lasted for five seconds. Gradually, 
three sessions, the duration of this other stimulus was increased 
from dark to dim green and then through various intensities of 
to the human eye matched the brightness of the red key. B 
conditions, three minutes of red Correlated with reinforceme 
with three minutes of freen correlated with extinction. With ea 
this procedure was used, stimulus control was established 


responses to the extinction key in the entire course of traini 
almost without exception on the red ke 


over the course of 
and it was changed 
green to a level that 
y the end of these 
nt were alternating 
Ch pigeon for which 
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nd intensity, hundreds 
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errorless is not obvious. We must be cautious about 
the language of errors in this type of situation; the term error implies a judgment 
about the value of responding that does not seem appropriate to a behavioral 
analysis. Nevertheless, the fading procedure is not solely of theoretical interest; 
because of its potential for applications to education and other areas, it may have 


much practical significance. 


in which such a performance is 


The vocabularies of 
differentiation and 
discrimination: A summary 


Both differentiation and discrimination involve differential reinforcement. 
The major difference between these two processes is whether differential reinforce- 
ment is imposed on properties of responding or on properties of the stimuli in the 
Presence of which responding occurs. The main implication of this difference is 
Procedural: in studies of differentiation, the experimenter must wait for the 
Organism’s responses, whereas in studies of discrimination the experimenter can 
control the order and duration of stimulus presentations. These and other variables 


determine the form of stimulus-control gradients, which include generalization 
ts, and inhibitory gradients. Ambiguous cases 


Bradient discrimination gradien e 
of Rehr I Teriniiion can be identified (e.g., should we ৰ that in 
temporally spaced responding the organism discriminates 5 a 2 a সি 
that the temporal properties of the organism S responses have ET A fl ¥ 
but the procedural difference seems to justify the maintenance of this distincti 

in vocabulary. The vocabulary is summarized in the following: 
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DISCRIMINABLE PROPERTIES 
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d in our everyday vocabulary, as when 


Discriminated operants are behavior classe. 
nswering a telephone. The red light 


responding. Such classes are often identifie 
we speak of stopping at a red traffic light 0 
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can vary in brightness and size and the telephone ring can vary in loudness and 
timbre, but responding is reasonably independent of variations along such 
dimensions and so we speak in terms of these classes of events rather than in terms 
Of particular instances. 

Relatively simple dimensions of stimuli, such as intensity or location, can be 
the basis for differential reinforcement. The experimental question is whether 
responding conforms to the differential consequences, in that more responding 
occurs in the presence of a stimulus correlated with reinforcement than in the 
presence of a stimulus correlated with nonreinforcement. Differential reinforce- 
ment may also be arranged for complex Properties of stimuli that are not easily 
quantified. For example, children learning to read must be able to name the 
letters of the alphabet. But the Properties important for distinguishing among some 
letters are different from those important in distinguishing among others (e.g., 
Straight line versus curve is important in distinguishing U and V, but not V and A 
or V and N), and different distinctions are important for lowercase than for upper- 
case letters (e.g., no pair of Uppercase letters has the up-down or left-right reversals 
that must be mastered to read b, p, d, and q). The way in which a child learns to 
distinguish among letters of the alphabet depends on relations among such stimulus 
Properties as symmetry, Curvature, and closure. Those properties essential to 
discriminating among different letters are called critical features (e.g., Gibson, 1965). 
It is not sufficient, however, to enumerate critical features. For some letters, the 
Upper-case and lower-case forms differ more from each other than they differ from 
Other letters (e.g., e, £, and F, or h, n, and WN). Given the multitude of forms, what 
then defines the class of stimuli that occasions our Saying a or e or r? This question 
is about stimulus structure in learning to read the letters of the alphabet; it is 


important to remember that questions of stimulus structure are different from 
questions of stimulus function. 


that a chair had been placed to the right of a 
are concrete objects, being-to-the-right- 
this relation from being-to-the-left-of. 
fore, relations among stimuli have been 


Of is not, and yet we can clearly discriminate 
In some discrimination experiments, there- 
the dimensions of interest. 
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Figure 7-6. Diagram of one trial in a matching-to 
Pigeon chamber. After an intertrial interval of t se 


G) appears on the center key. A peck on the ce! f 
Observing response, turns on comparison stimuli on the two side-keys. One compari- 


son stimulus matches the sample; the other (red: R) does not. If the pigeon pecks 
the matching comparison stimulus, food is delivered and the intertrial interval of a 
new trial begins; if the pigeon pecks the nonmatching comparison stimulus, the 
next intertrial interval follows without food. The sample stimulus and the location 


of the matching comparison stimulus vary from trial to trial. 


ample stimulus." This response 


igeon looks at the s 
ne matches the sample stimulus 


this peck guarantees that the p 


turns on comparison stimuli on the two side-keys. 0 \ 
and the other does not. A peck on the matching side-key then produces a reinforcer 


followed by a new intertrial interval, whereas a peck on the nonmatching side-key 
is followed directly by the intertrial interval without a reinforcer (sometimes a 
nonmatching peck also extends the duration of the intertrial interval, on the 
assumption that such a consequence would punish nonmatching pecks). 

The sample stimulus ordinarily changes from trial to trial; for example, it 
might be green on some trials and red on others. The position of the matching side- 
key also changes from trial to trial. A frequent component of matching-to-sample 

sponse, but that term is also applied to 


d an observing respo. i 
mes calle i as when reinforcement and 


iscrimil ulus, 
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is the correction procedure, which arranges that when a trial ends with a peck on a 
nonmatching side-key the same sample and comparison stimuli are repeated on the 
next trial. This procedure prevents the development of consistent responding to 
only one side-key or only one color. For example, suppose a Pigeon always pecked 
the comparison stimulus on the left side-key. If the position of the matching side- 
key nonetheless alternated irregularly between left and right, these pecks would be 
reinforced on half the trials, i.e., those with the matching stimulus on the left. 
Reinforcement on half the trials could be enough to maintain left-key pecking 
indefinitely. On the other hand, with a correction procedure the pigeon would have 
to switch to the right key sooner or later, because once a matching stimulus 
appeared on the right that combination of sample and comparison stimuli would 
not change until a right-key peck had occurred and been reinforced. Similarly, if 
the pigeon always pecked red side-keys, these pecks would be reinforced on half the 
trials unless a correction procedure forced the Pigeon eventually to switch some- 
times to pecking green side-keys. 

Suppose now a pigeon was responding accurately in a matching-to-sample 
Procedure using red (R) and green (G) stimuli. How should this performance be 
characterized? Has the pigeon learned to peck red side-keys in the presence of red 
center-keys and green side-keys in the presence of green center-keys? Or has the 
Pigeon learned to peck the left key in the presence of the configurations RRG and 
GGR and the right key in the presence of the configurations GRR and RGG? Or 
might we wish to say simply that the pigeon has learned the concept of matching? 
We might present blue or yellow as a new sample stimulus. If the pigeon matched 
these colors we would feel more confident about Speaking of a concept of matching 
(in fact, matching in Pigeons does not transfer easily to new colors, although the 
likelihood of such transfer can be modified by the training procedures). Even if we 
Saw matching with new colors, what would we say if we failed to get matching with 
geometric stimuli such as squares and circles as the sample and comparison stimuli? 
Would we say the pigeon had learned the concept of color matching but not form 
matching? If we did so, we at least would have to note that the human concept of 
matching seems not so limited by the specific dimensions of the matched stimuli. 

There are many variations on matching-to-sample. If reinforcement is 
arranged for pecks on the nonmatching rather than the matching side-key, the 
procedure becomes an instance of oddity responding, because this key is necessarily 
the odd key of the three keys (in other versions of oddity procedures, no sample is 
used; with pigeons in a three-key chamber, for example, each trial consists of 
lighting all three keys, with the odd key any one of them). The relations among 
stimuli can also be arbitrary, when the procedure is sometimes called symbolic 
matching. For example, we could train the Pigeon to peck a green key given a 
square as a sample and a red key given a circle as a sample. We might then ask 
whether the pigeon might peck a square given a green sample and a circle given 
a red sample. (We often expect such reversibility when we deal with words and 
objects, as when a child who has learned to point to a Picture of a car on seeing the 
word car can also point to the word on seeing the picture of a car. This reversibility, 
however, is not to be taken for granted; cf. chapter 10.) Each example involves a 
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complex of stimulus relations; such complexes are sometimes called concepts (e.g., 


Kelleher, 19583). 


Concept formation 


s generalization within a class of stimuli and 


| Concepts have been spoken of a 
discrimination between classes of stimuli (F. S. Keller and Schoenfeld, 1950). Thus, 


our concept of red must involve generalization among all stimuli we call red, and 
discrimination between these stimuli and others we do not call red. Concepts are 
therefore to classes of stimuli what operants are to classes of responses. (Some 
theorists speak of responding on the basis of some single property of stimuli as 
abstraction, and restrict concepts to responding on the basis of some combination 


of properties. We will not be concerned with such a distinction, in part because of 
its ambiguity. For example, being-to-the-left-of might be regarded either as a single 
relational property or as a combination of properties necessarily including both a 
reference point and a stimulus to the left of that reference point.) Whether we are 


dealing with size, color, shape, particular objects, or any other dimensions of the 
environment, we must always consider these as classes of stimuli, and therefore 
concept formation seems coextensive with discrimination learning. The literature 
On such learning is substantial (€.£., Blough, 1975; Heinemann and Avin, 1973; 


Mackintosh, 1977; Wildemann and Holland, 1972). 


Natural concepts 

t to define discriminative stimuli by 
rties that define the letter A vary 
wer case, and whether it appears 


that it is often difficul 


We already noted 
ple, the prope 


physical dimensions. For exam 


according to whether the letter is upper case or lo’ ther it appent 
as type or as script. The capacity to discriminate among such stimuli exists in 


animals as well as humans (e.8., discriminations between various forms of the letter 
A and the digit 2 have been established in pigeons: Morgan, Fitch, Holman, and 
Lea, 1976). But the difficulties of defining stimuli by measurable physical proper- 
ties are not limited to arbitrary classes established by humans, such as letters and 
numbers. They exist also with everyday objects and events. What distinguishes dogs 
from other animals? On what basis do we generalize between 2 chihuahua and a 
husky by calling both dogs, while we discriminate between huskies and wolves or 
foxes? (In quantitative treatments, such classes are called fuzzy sets: Rosch, 1973.) 

Procedures have been established in which pigeons learn to discriminate 
between pictures that contain a human form and those that do not (e.g., Herrn- 
stein and Loveland, 1964). Such discriminations have been called natural 
concepts. In one study (Herrnstein, Loveland, and Cable, 1976), Slides were 
presented on a screen next to 8 pigeons key and key pecks were reinforced in the 
presence of some slides but not others. Some pigeons learned discriminations 
between pictures with and without trees; others learned discriminations between 
pictures with and without water; Still others learned discriminations between 
pictures with and without particular people. After training with one set of slides 


160 Discriminated Operants: Stimulus Control 


(e.g., slides with and without trees), the pigeons discriminated among slides that 
0s 8 

had not been presented before. New slides were sometimes more accurately 

discriminated than slides used in training. The implications were drawn that: 


...Wwe cannot begin to draw up a list of co 
tree, the pigeons did not require that it b 
branching, and so on (overlooking the problem of common elements nested 
within terms like leafy, vertical, woody, and so on). Moreover, to be TECOgniz- 
able as a nontree, a picture did not have to omit greenness, woodiness, 
branchiness, verticality, and so on. Neither could we identify common 
elements in the other two experiments. If not common elements, what? 


No other theory is so easily characterized, though in crude terms an alterna- 
tive suggests itself. Pigeons respond to clusters of fe: 


mmon elements. To recognize a 
€ green, leafy, vertical, woody, 


Cc conjunctions and 
disjunctions, the discovery of which would require far more effort than seems 
Justified by any possible benefit. (Herrnstein, Loveland, and Cable, 1976, 
PP. 298-299) 
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Was measured when a sweet solution was presented and when drinking was accom- 
panied by noise and light. The results are summarized in Figure 7-7. Rats that had 
been Xi-jrradiated drank less sweetened water but their drinking was not affected 
by noise and light. Rats that had been shocked drank less when drinking was 
accompanied by noise and light but their drinking was not affected by whether 
the water was sweetened. In other words, when the aversive stimulus was the 
delayed systemic consequence of X-irradiation the rats learned only its relation to 
the taste of the water consumed earlier, but when the aversive stimulus was electric 
shock the rats learned only its relation to the external noise and light that preceded 
it. The delayed effects of X-irradiation punished the drinking of sweetened water, 
and the immediate effects of shock punished drinking accompanied by noise and 
light. Rats are therefore predisposed to learn different relations among 
discriminative stimuli and contingencies in different situations. It is not enough to 
say that organisms learn some responses or some stimuli more easily than others; 
an adequate account must deal with relations among responses and stimuli. 

Revusky and Garcia (1970) discuss the implications of these findings in the 
context of a thought experiment in which the reader finds one hundred dollars: 


n insane billionaire experimenter because, two 
for dessert instead of your usual 
crease the future probability that 
je. It is very unlikely that this experiment will be 


hours ago at lunch, you ate g0' 
apple pie. The experimen 
you would eat gooseberry P 


successful. . . . hundreds 0 L 
between consumption of the gooseberry pie and the receipt of the $100. The 


odds are very great that you would have associate 


Garcia and Koelling (1966) 
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Punisher 
mal fluid intake by rats whose drinking of sweetened 
Water was accompanied by clicks and lights. In one group, drinking was followed by 
“irradiation: in a second group, drinking was followed by electric shock. The 
discriminative stimulus that became effective in reducing drinking depended on the 
nature of the punisher. (From Garcia and Koelling, 1966, as presented by Revusky 
and Garcia, 1970, Figure 9.) 
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Learning set 


The organism can come to us prepared to learn different discriminations in 
different contexts, but we must also recognize that the learning of a new 
discrimination can be modified by the organism's history of discrimination learning. 
Relations among stimulus properties may come to control responding independently 
of specific stimuli. A phenomenon called learning set illustrates this (Harlow, 1949). 
Two different objects are presented to a food-deprived monkey in successive trials. 
Their positions vary from trial to trial. Food is under only one of the objects, and 
therefore the response of picking it up is differentially reinforced. After a number 
of trials, the monkey masters the discrimination. Then a new pair of objects is 
presented, with differential reinforcement again arranged for picking up only one 
of them. Again a discrimination is established. Another new pair is then presented, 
and later still another, and so on. 

Data averaged across eight monkeys are shown in Figure 7-8. Each set of 


points shows the percent of correct responses over the first six trials across blocks 


of successive discriminations or problems. (Note that the y-axis begins at 50%, 
Jems (1-8), correct responses 


Which is chance level.) Over the first eight prob 
increased gradually; by the sixth trial, they had not even reached 80%. In the next 


block of eight problems (9-16), correct responses increased more rapidly over trials. 
Over successive blocks, correct responding rose more and more rapidly over trials, 
until by the last block of trials (289-344) responding nearly reached 100% accuracy 
On the second trial of each new problem. In other words, the more problems the 

monkey had mastered, the more rapidly it mastered each new problem. দ 
In this procedure, the discriminated operant cannot be described simply in 
terms of the components of a given stimulus pair. When successive discriminations 
became established so rapidly that monkeys consistently choose the stimulus 
correlated with reinforcement after a single trial with a new pair, we must 
characterize performance by relations between stimuli and their correlated 
Consequences over successive problems rather than within single problems. When 
the early problems are presented, the monkey must learn not only the particular 
stimulus correlated with reinforcement within each problem, but also many other 
aspects of the procedure (e.g., that food is correlated with one of the two stimuli 
rather than with position or Some other feature of the setting; that this correlation 
does not change within problems; that reaching simultaneously for both stimuli 
will not be reinforced; and so on). During the early blocks of problems we must 
assume that learning takes place slowly because the monkey is learning many 
things; only later, when these other aspects of this situation are learned, does the 
monkey have to learn only which of the two stimuli is correlated with food in a 
given problem. At this point, we may define the discriminated operant as follows: 
If the response to a given stimulus is reinforced on the first trial with a new 
Problem, responding to that stimulus will continue on all subsequent trials; but if 
that response on the first trial is not reinforced, then responding will switch to the 
other stimulus for all subsequent trials. In learning set, this is the performance that 
is reinforced and this is the performance that the organism’s behavior approaches. 
Thus, the correspondence between the reinforcement contingencies and the 
163 
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behavior generated by these contin. 


gencies remains an appropriate criterion for this 
behavioral class. 
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three levers each within a monkey’s chamber. A particular sequence of presses was 
reinforced within a given session (e.g., left lever of group 1, right lever of group 2 
middle lever of group 3, right lever of group 4), but the sequence changed frOm 
session to session. After the monkey’s rate of mastering a new problem each day 
becomes stable, the repeated acquisitions permit the study of a diversity of learning 


variables (e.g., the effects of different fading procedures, or of different types of 
ons in the sequence, or of establishing 


consequences for responses at various positi 
sed to building it up either from the 


the response sequence as a whole as oppo. 
beginning or from the end). 


SUMMARY 


We began this chapter by exploring parallels between differentiation and induc- 
tion on the one hand and discrimination and generalization on the other. Both are 
outcomes of differential reinforcement; they differ mainly in whether this opera- 
tion is arranged for stimulus properties or response properties. Functional aspects 
of stimulus control were treated in the context of experiments on attention, on 
stimulus-control gradients, and on fading procedures. In dealing with discriminable 
properties of the environment, we considered how complex relational features of 
the environment could serve as discriminative stimuli. Examples included matching- 
to-sample, natural concepts, preparedness, and learning set. All of these cases 
emphasized stimuli, but we found it important to treat these stimuli in terms of 
their relations to responses and consequences. Stimulus control is based upon the 
three-term contingency: stimulus-response-consequence. None of these terms is 
Significant in isolation. We may speak of differential reinforcement with respect to 
stimulus properties, or of the correlation of stimuli with contingencies, or of 
Operant classes defined by the stimuli that set the occasion for responses in a class. 
These variations on the lan: imination learning simply stress different 


guage of discri 
aspects of the same three-term contingency. 


Schedules 
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Schedule is derived from the Middle English sedule, a slip of parchment or 
paper, which is in turn derived from the Latin scheda, papyrus leaf, and the 
Greek skhizein, to split. The Indo-European root, skei-, to cut or split links 
schedule to schizo-, as in schizophrenia, and to science and conscious ‘from 
the Latin scire, to know, in the sense of being able to separate one thing from 


another. 


istent consequences. The reinforce- 


asses of responses have cons 
es called intermittent or partial 


ment of some responses but not others, sometim 
reinforcement, is a general feature of behavior. For example, winning is not an 
invariable consequence of placing a bet, and finding a particular commodity is not 
an invariable consequence of going to a store, and getting an answer is not an 
invariable consequence of asking a question. Consider calling a friend on the 
telephone. Sometimes this response is reinforced by the opportunity to talk with 
one’s friend. At other times, the friend does not answer or the friend’s line is busy. 
Continuous or regular reinforcement, the reinforcement of every response within 


an operant class, is the exception rather than the rule. For this reason, we must 
examine the effects of schedules of reinforcement, arrangements that specify which 
responses within an opera 


nt class will be reinforced. We already have considered 

some schedules (ratio schedules in chapter 1, differentialreinforcement-oflow-rate 
schedules in chapter 6, and interval schedules in chapter 7). Schedules can arrange 
reinforcement on the basis of response number, the time at which responses occur, 
d rate requirements can also be 


or the rate of responding. Number, time, an 
combined in diverse ways to produce more complex schedules. We will be 


concerned with the effects of reinforcement schedules not only as valuable 
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experimental tools, but also as ubiquitous and perhaps fundamental properties of 
behavior in their own tight (cf. Ferster and Skinner, 1957; Schoenfeld and Cole, 
1972). 

Let us return to the telephone example. Suppose we call a friend and get no 
answer. The likelihood of getting an answer on later calls ordinarily depends on 
When we call and not on how many times we call. The friend will be available and 
thus the call will be answered only if it occurs at a particular time. Similarly, 
Suppose we call and get a busy signal. Our calling again will not affect how long the 
busy signal lasts. Some variable time will elapse that depends only on how long the 
friend talks with someone else. Once the friend hangs up, our next call will produce 


call to get an answer, but 
a particular time. These 
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differently than if the reinforcer is some other consequence, such as a telephone 
sale made by someone in business or the recruitment of a volunteer for a fund 
drive. Getting an answer to a telephone call is usually reinforced according to a VI 
schedule, but because making a sale or recruiting a volunteer usually depends on the 
number of calls made, these events typically reinforce calling according to a VR 
schedule. We will not ordinarily have problems when we deal with simple responses 
such as pigeons’ key pecks or rats’ lever presses and simple reinforcers such as food 
Or water deliveries. We will concentrate on such cases in the following survey. It is 
important to remember, however, that accurately applying the language of 
reinforcement schedules to settings outside the laboratory demands that we 
carefully specify the contingencies and the responses and reinforcers that enter into 


these contingencies. 


VARIABLE-INTERVAL AND 
VARIABLE-RATIO SCHEDULES 


In a variable-interval or VI schedule, the delivery of a reinforcer depends on the 
Passage of a variable time and then the emission of a single response. In a variable- 
ratio or VR schedule, the delivery of a reinforcer depends on the emission ofa 
variable number of responses without regard to the passage of time. Some 
properties of VI and VR schedules are illustrated by hypothetical cumulative 
records in Figure 8-1. Three records that might be generated by VI 1-min reinforce- 
ment are shown on the left (4, B, and C); three records that might be generated by 
VR 100 reinforcement are shown on the right (D, E, and F). 1 
Consider first how we might arrange 8 VI schedule. We take a tape or film 
Strip that can be driven at a constant speed by a motor and punch holes in it at 
irregular intervals. Using a switch that senses the location of these punched holes, 
we then allow a response to be reinforced each time one of these holes ত 
the switch. A portion of such a tape is shown just above the time scale in Figure 
8-1.1 An alternative to the tape as 4 method for VI scheduling is to use a clock that 
provides a sequence of electrical pulses and randomly to select some A of 
these pulses to arrange or set up a reinforcer for the next response. Each met od 


has advantages and disadvantages: With a tape, the experimenter must be a 
with the effects of repeating the sequence over and over; with random selection 
from a pulse source, the experimenter does not know ahead of time how long the 
next interval will be (cf. Fleshler and Hoffman, 1962; Farmer, 1963; Millenson, 
1963; and Reynolds, 1968, Appendix ID). | 
LY B and C of Figure 8-1 illustrate one of the most important 
il i i i 8-1 is whether the 
ie i f VI scheduling not incorporated into Figure 
tape Go ERS inning once a reinforcer has been scheduled for the next response. 


i til the reinforcer has been delivered 
ents are used, stopping the tape unty t1 b 
ET OE EL SES dure, because it guarantees. that intervals are timed from some 
environmental event, namely the reinforcer delivery, with which the organism has come in 
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Figure 8-1. Hypothetical Segments of cumulative records 
tained by a one-minute variable-interval schedule (VI l-min) and a one-hundred- 
response variable-ratio schedule (VR 100). The i i 


umber of responses per reinfo. 
response rate. 


Properties of a VI schedule: The rate at which reinforcers are delivered is not 
affected by the rate of responding. 

records are considerably different 
reinforcers in each case. This sched 
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ing stimulus-control gradients: chapter 7). 


ariable-ratio or VR schedule. Again we use a 
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CRE EES 

d i e switch senses a hole in the tape the 
next response is reinforced. On the average, one response is reinforced per hundred 
responses but the number varies from one reinforcer to the next. In this case 
number of responses per reinforcer is independent of response rate, and rate of 
reinforcement depends on response rate. In Figure 8-1, response rate is highest in 
2D, and thus the reinforcers are delivered within a shorter period of time than in E'; 
similarly, equal periods of time are represented in E and F, but more responses নি 
reinforced in E than F because response rate is higher in E£ than in F. 

As with VI schedules, alternative methods of scheduling are available. For 
example, a device that randomly selects some fraction of the emitted responses 
for reinforcement might be substituted for the tape. With either method of schedul- 
ing, the VR schedule like the VI schedule generates roughly constant rates of 
responding between reinforcers, at least with moderate ratio sizes. When the ratio 
Of responses to reinforcers becomes very large, VR responding may be interrupted 
frequently by pauses. 

Rates of responding maintained by VI and VR schedules are illustrated in 
Figures 8-2 and 8-3. Both figures show data obtained with pigeons’ key pecks 
reinforced with food. In Figure 8-2, rate of pecking is plotted as a function of rate 
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tion after VI and after VR reinforcement. A low rate of VI reinforcement produces 
less responding than a high rate of VI reinforcement, but in both cases responding 
is fairly uniformly distributed in time; in addition, the decrease in response rate 
during extinction after VI reinforcement is gradual. With VR reinforcement, on the 
other hand, the decrease in response rate with larger ratios comes about in part 
because responding begins to be interrupted by long pauses (the appearance of long 
pauses during ratio performance is sometimes referred to as ratio strain); in 
addition, extinction after VR reinforcement usually produces abrupt transitions 
from high response rates to periods of no responding. 

What properties of VI and VR schedules are responsible for their characteris- 
tically different performances? It seems reasonable that VR schedules should 
generate higher rates of responding than VI schedules, because an increase in VR 
response rate makes the delivery of reinforcers more frequent whereas an increase 
in VI response rate cannot have this effect. In what way can these different 
relations between response rate and reinforcement frequency act on an organism’s 
behavior? Whether the schedule is VR or VI, the separation between successive 
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VI schedule in which the successive interreinforcement intervals match those 
by the first pigeon’s VR performance. In these circumstances, the two 
50! edules differ in responses per reinforcer but not in time between successiv! 

reinforcers. Conversely, the role of responses per reinforcer can be studied By 
ITRUBNE one pigeon’s VI schedule so that the responses emitted per reinforcer by 
this pigeon determine the ratios of a second pigeon’s VR schedule. In this case ie 
second pigeon’s pecks are maintained by a VR schedule in which the SOdRUVE 
ratios match those produced by the first pigeon’s VI performance. In these circum- 
stances, the two schedules differ in the times between successive reinforcers but 


not in the responses per reinforcer. 
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schedules, the distribution of reinforced IRTs has the same form as the distribution 
of all IRTs. This is not so with VI schedules; because the probability of reinforce- 
ment increases with IRT, a relatively larger proportion of long IRTs is reinforced 
and therefore the distribution of reinforced IRTs is shifted toward longer IRTs 
relative to the distribution of all IRTs (cf. Revusky, 1962). These relations among 
IRT distributions are inevitable features of VR and VI scheduling. Organisms must 
therefore in some sense be sensitive to them when their responses are affected by 
the respective schedules. 

Limited hold. One temporal contingency sometimes added to schedules is 
called the limited hold or LH. With a limited hold, a scheduled reinforcer remains 
available or is held for a response only for a limited time; if no response occurs 
within that time, the reinforcer is lost. The telephone again is an illustration. If one 
gets a busy signal when calling into an overloaded switchboard, getting a ring and 
an answer may not become more likely as time passes, because the lines never 
remain open very long. If the lines at such a switchboard are used again within only 
five seconds after someone hangs up, we describe the schedule as a VI schedule with 
a five-second limited hold. A caller is most likely to get through such a switchboard 
by hanging up immediately after a busy signal and dialing again within five seconds. 
In the laboratory, adding a limited hold to a schedule typically increases response 
rate (e.g., Hearst, 1958); in general, the shorter the limited hold the greater this 
increase, except that a very short limited hold may allow so few responses to be 
reinforced that the schedule fails to maintain responding. The effect of the 
limited hold on VI performance can be interpreted in terms of its attenuation of 
the differential reinforcement of long IRTs: the probability of reinforcement does 
not increase as IRT durations approach the duration of the limited hold, because 
the limited hold may cancel the most recently available reinforcer before the next 


response occurs. 


Reinforcement schedules and 
causality 


of reinforcers depend on the responses they follow, but 
reinforcers can follow responses both when produced by 1eeponses and when 
delivered independently of responses. Is responding affected in the same way when 
it produces a reinforcer as when it happens by accident to be followed bya 
reinforcer? We gave this question a preliminary answer when we considered the 
phenomenon called superstition. The role of the temporal contiguity between 
responses and reinforcers has been discussed by Herrnstein (1966): 


The effects 


i j i i in time among responses, stimuli 
ble that the accidental correlations in t ly li, 
fc EE do not exert control over behavior?... One of the characteris- 
tics of accidental correlations between behavior and environmental events is 
variability. Every aspect of behavior may vary and yet be contiguous with a 

dent of the behavior. In contrast, behavior that is 


inforcer that is indepen ; 
it must have at least one aspect that has a more or less fixed 
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Correlation with the reinforcer. Were animals sensitive to this difference, they 
could detect those events over which their behavior has no real control. 
(Herrnstein, 1966, pp. 42-43) 


Consider now an experiment by Lattal (1974). Pigeons’ key pecks were 
reinforced according to a VI schedule. Once VI performance was well established, 


the schedule was changed: at the end of Some intervals, the reinforcer was delivered 
immediately, without regard to responding. For exam 
reinforcers were response-produced 


-produced, response 
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When reinforcers are delivered independently of responses, the time between 
the most recent response and the reinforcer is likely to vary from one reinforcer to 
the next. Thus, Lattal’s data suggest that this variability does counteract 
superstitious responding. But what about cases in which the time between the most 
recent response and the reinforcer varies even though the reinforcer is response- 
produced? This occurs, for example, when delay of reinforcement is added to a 
schedule. A case is illustrated in the event records in Figure 8-7 (left). Vertical lines 
represent responses and arrows represent reinforcers. The top record shows a 
segment from a standard VI schedule: the interval ends at the dashed line, and the 
next response, a, is followed immediately by the reinforcer. A segment from a 
VI schedule to which a three-second delay of reinforcement has been added is 
shown in the middle record: the interval ends at the dashed line, and the response b 
produces a reinforcer three seconds later; because other responses occur during 
this time, the time between the last response and the reinforcer (c) is shorter than 
the delay interval (d), and this time varies depending on the spacing of responses 
during the delay. The bottom record shows a segment from a VT or response- 
independent schedule: when the interval ends, at the dashed line, the reinforcer 
is delivered, and because it occurs independently of responses, the time between 

inforcer (e) varies. 

We ৭ পপ্লদ ls < he data from three Pigeons Obtained with 
these schedules (Sizemore and Lattal, 1977; see also B. A. Williams, uy The 
rates of pecking maintained by VI with delay were between the erate rates 
maintained by the VI schedule and the low rates maintained by the response- 
independent reinforcers of the VT schedule. We just concluded that the difference 
between VI and VT must depend in some way on the variable times between the 
last response and the reinforcer (¢ and e). Why then are ol rates with delayed VI 
reinforcement higher than those with VT reinforcement? The Rs remain open 
to experimental analysis, but it is reasonable to assume when i Bt ll 
decreases with delayed VI reinforcement, it eventually reac esa চা at whic 
Pauses longer than the delay occur often enough that +H ৰণ 6 pe 
intervene between the reinforcer and the response that produced it. In these 
instances the times between the last response and a reinforcer can NW 
than the scheduled delay, and therefore they will no longer be hes e. With VT 
reinforcement, on the other hand, there is no upper limit on these times, and 

e variable. 
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that separate responses and reinforcers. These distributions distinguish events 


ৰে i ter 5 (cf. Figure 5-5). Other procedures 

2 gl yas discussed in chapte ন ট ৰ 
ft Delay of zelnorm on lade the presentation of a distinctive stimulus VE the 
kl studying HEfnforceniolit a consequence of responding that may acquire reinforcing 
lay, which is an immediat elation to the later reinforcer delivery, and the requirement 
Properties of its own through its 1 dure is equivalent to the differential 


TO CEA 
that no responses occur during the ee a Ferster, 1953; Dews, 1960). 
Teinforcement of a response followed by a pause CC - 


180 Reinforcement Schedules 
caused by behavior from events accidentally correlated with behavior; they are 
therefore central to the concept of causality. For this reason, the study of 
reinforcement schedules is fundamental, and the demonstration that humans are 
similarly sensitive to the consequences of their own behavior has both theoretical 
and practical significance (cf. Holland, 1958; Weiner, 1969; B. A. Matthews, 
Shimoff, Catania, and Sagvolden, 1977; Rachlin and Frankel, 1969). 


FIXED-INTERVAL AND 
FIXED-RATIO SCHEDULES 


er deliveries i 


S constant are called fixed-time or FT 
€S are of intere. 


St because they introduce discriminable 
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ing reaches half its final or terminal rate 40 seconds into a 100-second fixed 
interval, it is likely to do so 20 seconds rather than 40 seconds into a 50-second 
fixed interval. (This consistency in relative rather than absolute time appears to be 
a property of elicited as well as consequential behavior; cf. Killeen, 1975.) 

Any theoretical treatment of FI performance must consider the finding that 
the FI scallop survives repeated interruptions. The phenomenon is illustrated in 
Figure 8-8 (Dews, 1962). The key pecks of four pigeons were reinforced with food 
according to FI 500-second schedules. The left graph shows the average rate of 
pecking in successive fifty-second portions of the interval. In another procedure, 
the houselight, which provided general illumination in the chamber, was turned on 
and off in successive fifty-second portions of the interval; after the last fifty 
seconds of the interval, it remained on until the peck at the end of the interval was 
reinforced. Pecking was not reinforced in the absence of the light, and pecking 
during these times decreased in rate, as shown by the dark bars in the right graph 
of Figure 8-8. Nevertheless, the increase in responding in successive periods when 
the houselight was on, as shown by the shaded bars, was similar in form to the 
increase when there were no interruptions of FI responding during the interval. 
Because of findings such as these, the FI performance has received varied 
theoretical treatments (e.g., Gollub, 1964; Zeiler, 1977; B. A. Schneider, 1969; 
Dews, 1970b; Hawkes and Shimp, 1975). Among the issues are whether the FI 
Scallop represents a gradient of delayed reinforcement (in that responses at various 
locations in the interval are consistently followed by the reinforcer at the end of 
the interval) or a gradient of temporal discrimination in that the organism’s 
different rates at different times in the interval imply discrimination of the elapsed 
time ination of such gradients. 

, ER to Frodsalio or FR schedules. In these schedules, the 
last of some fixed number of responses is reinforced. For example, it usually 
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pigeon’s pecks on one key were reinforced according to an FR 100 schedule while a 
VR schedule operated concurrently on the other key. The VI schedule operating on 
the other key retarded the development of the typical FR break-and-run 
performance, and in effect made it possible to examine this development in slow 
motion; the change in the temporal patterning of responses that usually takes place 
in two or three sessions took place over 150 sessions. In the early sessions, respond- 
ing was fairly uniformly spaced between reinforcers. Lower response rates and 
eventually pauses began to follow reinforcers only with continued exposure. The 
portion of the ratio consisting of uninterrupted high-rate responding gradually 
became longer, so that in later sessions responding continued with few if any 
interruptions after each postreinforcement pause. With successive sessions, the FR 
run appeared to build itself up backwards from the reinforcer. 


The treatment of schedule performances as response units in their own right 


led to the development of higher-order schedules of reinforcement. For example, 
n of a certain number of fixed 


if reinforcement depended on the completio 
intervals, we could speak of this arrangement as a second-order schedule in which 


successive FIs were reinforced according to an FR schedule (e.g., Findley, 1962; 
Kelleher and Gollub, 1962; Bigelow, 1971). An analysis of the component 
performances in such schedules would be concerned with the structure of behavior, 
and would be analogous to examining the properties that define an operant class 


(cf. chapter 6 and Shimp, 1976). 


THE VOCABULARY OF 
REINFORCEMENT SCHEDULES 


inforcement schedules. The vocabulary for these 
but also includes admittedly idiosyncratic 
Jar rationale for distinguishing between FI 
two different names were needed 


We have considered a variety of re 
schedules provides major classifications, 
usages. For example, there is no particu 


and VI versus FT and VT schedules except that 1 : 
(we could as easily justify such names as fixed-duration and variable-duration, 


presumably abbreviated FD and VD). Nevertheless, it is important to note the 
distinctions. For example, Fl, DRL and FT schedules all require the passage of a 


constant time, but they differ in response requirements. An FI schedule imposes 
no restriction on responding during the interval, but one respone must occur at 
the end of the interval; some time must elapse, during which few or many responses 
can occur, and at the end of that time the next response is reinforced. A DRL 
schedule, however, requires a specified time without responses, and then the next 

t the time over. Finally, 


response is reinforced; responses that occur too soon Star r. Final 
in an FT schedule, the reinforcer is delivered at the end of the specified time 


Whether or not responses have occurred. These schedule names were decided some- 
What arbitrarily in the evolution of research on reinforcement schedules, but 
are well established (cf. Catania, 1968). 

Although there have been several noteworthy attempts to classify reinforce- 
ment schedules more systematically (e.8., Schoenfeld, Cumming, and Hearst, 1956; 


TABLE 8-1 


Summary of Basic Reinforcement Schedules (T sec = time in seconds; N = number 


of responses) 


Name and Abbreviation 


Comment 


Variable interval 


Fixed interval 


Variable ratio 


Fixed ratio 


Variable time 


Fixed time 


Continuous 
reinforcement 


Extinction 
Limited hold 


Differential 
reinforcement of 
low rate (or long 
IRT) 


Differential 
reinforcement of 
high rate 
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Contingency 
VI T sec, then 1 response 
FI T sec, then 1 response 
VR N responses 
FR N responses 
VT T sec 
FT T sec 
(FRI) 71 response 
EXT ~- 
LH Reinforcer canceled 
if no reinforced 
Tesponse within T 
sec 
DRL T sec without 
response, then 1 
response 
DRH 


1 response within T 
Sec Or less of last 
response 


T varies; with properly 
distributed intervals, 
generates roughly constant 
response rate. 


T is constant; generates FI 
scallops 


N varies; high constant 
rates, but large N produces 


breaks in responding called 
ratio strain. 


Nis constant; generates 
postreinforcement pauses 
and high-rate runs. 


T varies; provides response- 
independent reinforcers. 


Tis constant; provides 
response-independent 
reinforcers. 


All responses reinforced; 
formerly abbreviated as 
CRF. 


As description of procedure, 
often used without regard 
to whether response had 
Prior history of reinforce- 
ment. 


T is constant if not other- 
wise specified; LH cannot 
Stand alone, but must 


always be a feature added to 
Some schedule. 


Maintains responding easily 
because decreases in 
Tesponse rate increase rein- 
forcement and thus prevent 
extinction. 


Alternatively, at least N 
Tesponses within T sec; 
Tesponding may be difficult 
to maintain, because 
decreases in Tesponse rate 
lead to decreases in rein- 
forcement. 


TABLE 8-1 (continued) 


Name and Abbreviation Contingency Comment 
Differential DRP 1 response between Sets both upper and lower 
reinforcement of T and T'sec of last limits on reinforceable 
paced responding response response rates. 
Differential DRO T sec without A negative-punishment or 
reinforcement of response omission procedure; 


ordinarily decreases the 
likelihood of the designated 
response. 


other behavior 


Skinner, 1958; Mechner, 1959; Dews, 1960; Snapper, Knapp, and Kushner, 1970; 
Schoenfeld and Cole, 1972), none has attained general usage. Table 8-1 summarizes 
among schedules. The schedule definitions apply 
Whether reinforcers are arranged successively and without interruption or occur 
Within separate trials (e.g., an Fl is ordinarily timed from the last reinforcer, but if 
Other events were arranged between successive intervals the timing could begin with 


the onset of some stimulus, such as a color presented on a pigeon key). a 
The first two columns of Table 8-1 provide schedule names and their standard 


abbreviations. In practice, the abbreviations are usually accompanied by designa- 
tions of time or number (e.g., VI 30-sec, FT 60-sec, LH 5-sec, DRL 10-sec, and 
FR 50). The next column describes the schedule contingency, or the conditions 
under which the schedule reinforces responses of delivers reinforcers (cf. especially 
the relations between FI and FT and between DRL and DRO). The last column 
Provides brief commentary on each schedule. The vocabulary of Table 8-1 is in 
terms of reinforcement schedules, but can also be extended to punishment 
schedules (e.g., Azrin, 1956; Hendry and Hendry, 1963). The symmetry of 
reinforcement and punishment effects, illustrated in Figure 5-1, applies to schedul- 
ing effects as well (e.8., superimposing an FI schedule of punishment on maintained 
Tesponding produces an inverted scallop, a gradually decreasing response rate as the 


end of the interval approaches). 


the major procedural distinctions 


SCHEDULE COMBINATIONS 
chedules can be created without limit 


ys (e.g., Herrnstein and Morse, 1958; 
1s, 1968; Barrett, 1975, 1976). For example, 


reinforcers can be arranged for completing either of two s ST requirements 
(alternative schedules) or both of two schedule requirements (conjunctive 


5 j .sec FR 50 schedule, either the first response 
hedules). In att terns? El onse will be reinforced, whichever comes 


after thirt ds or the fiftieth resp i i 

fist; in a CUTE FI 100-sec FR 20 schedule, a response will not be reinforced 

until both 700 ds have passed and nineteen responses have already been 
seconds t, the arrangement is called an 


j i interac 
emitted. When time and number requirements In’ টা 
interlockin g schedule. For example, an interlocking FR FI schedule might decrease 


lation. New s 


Schedules need not operate in is : 
various Wa 


by combining the basic schedules in 
Berryman and Nevin, 1962; Powe 
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the duration of the interval as a function of number of responses, or EE 
increase the size of the ratio as a function of the passage of time (an instance of an 
interlocking schedule is winding a Wristwatch, in which the reinforcer is the tension 
of the fully wound spring; to the point at which the watchspring is completely run 
down, the number of turns required to wind the Watch increases with the passage 
of time). 


The effects of schedule combinations are Sometimes paradoxical. For 


hat an FI 15.minute schedule of 
tained more than 300 pecks per 
t was then added by changing to a 
quirement of FR 40, well below the 
d per interval anyway, the rate of 
» Up to a conjunctive FR 240 FI 15-minute 


1 On Various problems of historical 
£ (multiple schedules), conditioned 
Oice (concurrent schedules). We will 
areas in a sampling of specific research 


outlined in Table 8-2. Some schedules bea: 
Significance, such as discrimination learnin, 
reinforcement (chained schedules), and chi 


Provide only a brief overview Of these three 
issues. 


Multiple schedules: 
Behavioral contrast 


stimulus, and the performance Appropriate to e 
corresponding stimulus. For example, if an FI 


L t Tcement versus extinction but also with 
two different reinforcement schedules). 


he components of a multiple schedule are not 


necessarily independent of each other. For example, if VI reinforcement maintains 


TABLE 8-2 


Compound Schedules. For convenience, each case is defined in terms of only two 


arbitrary component schedules, 
number of components. In some insta 


A and B, but compound schedules can include any 
nces, Stimuli are designated by S with a 


superscript that indicates the schedule each stimulus accompanies. 


Schedule 


Definition 


Example (with Abbreviation) 


Alternative 


Conjunctive 


Interlocking 


Multiple 


Mixed 


Chained 


Tandem 


Concurrent 


Conjoint 


Second-order 


Reinforcer is contingent on 
either A or B. 


Reinforcer is contingent on 
both A and B. 


Reinforcer is contingent on 
some combined function of 
A and B. 


A during SA alternates with 
B during SB. 


A alternates with B. 


During 54, completion of A 
produces SB; during S°, 
completion of B produces 
reinforcer. 


A produces 


Completion of 
f B produces 


B; completion o 
reinforcer. 


one response 


A operates for 
for another 


and B operates 
response. 


at the same 


A and B operate 
dently for 


time but indepen 
a single response. 
nof Ais reinforced 
B. (Reinforcing 
der according to 
6 creates a 
dule, and SO 

r orders.) 


Completio 
according to 
the second-or 
a third schedu 
third-order sche. 
on for other highe 


Completion of either a VR or 
a Vl is reinforced, whichever 
occurs first (altern VR VI) 


Reinforcer depends on the 
completion of both an FR 
and an FI (conjunc FR FI) 


Reinforcer is delivered when 
the sum of responses plus 
elapsed seconds equals some 
constant (interl FR FT) 


VI during red alternates with 
EXT during green (mult VI 
EXT) 


DRL alternates with FI, with 
no correlated stimuli (mix 
DRL FD) 


Completion of VR during 
blue produces yellow; 
completion of FR during 
yellow produces food (chain 
VR FD) 


Completion of VR produces 
DRH, and completion of 
DRH produces food, with no 
correlated stimuli (tand VR 


DRH) 


One VI operates for pecks on 
left key and another VI for 
pecks on right key (conc VI 


VD 


VI and DRP operate simul- 
taneously for presses on a 
single lever (conjt VI DRP) 


Successive FRs are treated as 
response units reinforced 
according to an FI schedule 


(FI [FR]) 
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a pigeon’s key pecking during one stimulus and the schedule in the presence of a 
second stimulus changes from VI reinforcement to EXT (extinction), decreased key 
pecking during the second stimulus is often accompanied by increased key pecking 
during the first. The phenomenon, called behavioral contrast, is illustrated in 
Figure 8-10 (G. S. Reynolds, 19610). A multiple VI 3-minute VI 3-minute schedule 
Operated for a pigeon’s key pecks in red and green. The VI reinforcement in red was 
maintained while the schedule in green was changed from VI to EXT and then back 
to VI. The response rate in red increased during EXT in green, even though there 
Was no change in the red VI schedule; both rates returned toward their earlier 
Values when the green VI schedule was reinstated. 

Behavioral contrast has received extensive analysis, ranging from accounts 
in terms of quantitative properties of schedules (e.g., G. S. Reynolds, 1961a; 
Lander and Irwin, 1968; Herrnstein, 1970; Rachlin, 1973) to accounts that appeal 
to frustrative or emotional effects of reduced reinforcement (e.g., Amsel, 1958; 
Terrace, 1968). Varieties of contrast effects have been distinguished, ranging from 
relatively permanent or sustained increases to those that last for only seconds or 
minutes after a schedule change (e.g., Catania and Gill, 1964; Nevin and Shettle- 
worth, 1966; Staddon, 1969). Contrast effects also Vary with responses, reinforcers, 
and organisms (e.g., Hemmes, 1973; Beninger and Kendall, 1975). 


Empirical questions have been raised by behavioral contrast (e.g., whether it 
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Figure 8-10. Effects of extinction in one component of a multiple schedule on 
responding maintained by reinforcement in another component. The multiple- 
schedule components consisted of alternate three-minute periods of red and green 
projected on the pigeon key. In the sessions Shown, a VI 3-min schedule of food 
reinforcement for the pigeon’s pecks operated during red; the schedule during green 
was changed from VI 3-min to extinction and back to VI 3-min. The increased 
response rate in the unchanged component during the decreased reinforcement in 
the other component is called behavioral contrast. (After G. S. Reynolds, 19616, 
as adapted by Terrace, 1966, Figure 10.) 
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2 (B). (From Catania, 196914, 


Figure 8-11. Two interpretatio 


response 1 produced by extinc 
response 1 produced by reinforcement of response 


Figure 4.) 


is related to peak shift: see chapter 7), but the phenomenon also involves 
conceptual issues. Assume response 1 is a key peck in red and response 2 is a key 
peck in green. Both A and B in Figure 8-11 might be discussed in terms of 
behavioral contrast: response 1 (Solid line) increases from its baseline level (X) to 
a higher level (Y) during extinction of response 2 (dashed line), and returns to its 
baseline level when reinforcement of response 2 is reinstated. But how are we to 
tell which is effect and which is baseline? We could assume equally well either that 


extinction of response 2 enhanced response 1, as shown by the arrows in A, or that 
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reinforcement of response 2 inhibited tesponse 1, reducing it from its baseline level 
When reinforced by itself, as shown by the arrows in B. 

The data argue that behavioral Contrast is consistent with 4 of Figure 8-11, 
and can best be interpreted as the Summation of two kinds of pecks: those 
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rds of the performances of three pigeons on chained 
ted of six consecutive FI 15-sec schedules in the 
The six components, numbered in the top 
ments; food deliveries were followed by a 
(Catania, Yohalem, and Silverman, unpub- 


Figure 8-12. Cumulative reco 
and tandem schedules. Both consis 
last of which a peck produced food. 
record, are separated by pen displace: 
reset of the pen to its baseline position. 
lished data.) 

more than the 90-second minimum demanded by the six successive 15-second 


intervals (the schedule differs from FI 90-sec in that at least six pecks are required, 
one at the end of each of the six 15-sec intervals). The bottom record shows the 
same chained and tandem schedules operating in alternation for Pigeon 392’s 
pecks. Again long pauses occurred in the early components of the chain whereas 


the tandem sequence was completed expeditiously. p f 
The chained-schedule stimuli reduced responding relative to the tandem 
this phenomenon with the assumption that the 


schedule. How can we reconcile 3 ৰ 
successive stimuli of the chain should become conditioned reinforcers through their 
relation to the food at the end of the sequence? The presence of different stimuli 


in the chained schedule made a performance otherwise executed in a little more 
than 90 seconds sometimes take 15 minutes or more (with a corresponding decrease 


in the frequency of reinforcement). The low rates in the early components of 
ombined discriminative effects of the chain 


chained schedules represent the c d 
stimuli, in that responding is never reinforced with food in the presence of these 
stimuli, and the reinforcing effects of the contingency between responding and the 

e data show that only the last components of the 


onset of the next stimulus. Th t j t 
substantial reinforcing effects. 


chain, close to the reinforcer, acquire 
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Concurrent schedules: 


Matching, maximizing, and 
choice 


20 reinforcers per hour by pecking only the first 
key, and 60 reinforcers per hour j 
keys, however, it could produc 
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son organisms tend to distribute their responses 
dules. But there is a complication. If a peck on 
by a reinforced peck on the other key, the 
d of the sequence of pecks. Thus, pecks on each 


key may come partially under the control of the reinforcers scheduled for the other 
key. To maintain the independence of the schedules, therefore, concurrent VI 
Procedures often incorporate a changeover delay, which prevents either response 
from being reinforced immediately after a changeover from the other response 
(Herrnstein, 1961a; Catania and Cutts, 1963). With a changeover delay, the 
distribution of an organism’s responses to two concurrent schedules tends to be 
proportional to the distribution of reinforcers arranged by the schedules (e.8., 
Herrnstein, 1961a; Herrnstein and Loveland, 1975). In the preceding case, the 
Pigeon emits about twice as many pecks on the VI 30-sec key as on the VI 60-sec 


key. 


reinforcers per hour. For this rea 
to each of two concurrent VI sche 
one key is immediately followed 
reinforcer may affect the likelihoo 


This phenomenon is sufficiently general that it has been proposed as a general 
law of behavior, called the matching law (Herrnstein, 1970; see also Catania, 1976; 
Rachlin, 1971). The law states that the relative frequency of a response will match 
the relative frequency of the reinforcers produced by that response. The law even 
holds for concurrent ratio schedules, because exclusive responding on one schedule 
means that all the reinforcers will be delivered according to that schedule. Herrn- 
stein’s account has also been applied to the responding maintained by a single 
tel laroanink schedule. Us Figure 8-2), on the assumption that other events 
besides the reinforcers arranged by the experimenter may have reinforcing effects, 


even though we cannot identify them. Y ; 
tively summarizes performances in a variety of 


The matching law effec : J 
schedules, but its empirical foundations have been controversial. For example, 
consider how concurrent VI schedules operate when arranged for a pigeon’s pecks 

to peck one key, more and more time passes 


On two keys. As the pigeon continues 

during which the VI schedule for the other key may have made a reinforcer 
available. Thus, a time will be reached when the probability of reinforcement for 
changing over 10 the other key will be greater than the probability of reinforcement 


for continuing to peck the same key. It has therefore been argued that the pigeon 
distributes its responding to both keys in concurrent VI schedules because the 
Pigeon always emits that response with the highest current probability of reinforce- 
ment, and this response shifts from one key to the other as time passes (Shimp, 
1966). This performance has been described in terms of the principle of maximizing, 
which states that an organism with several responses available will emit the one with 
the maximum probability of reinforcement. In the Case of unequal concurrent VR 
or FR schedules, this response is always the one with the smaller ratio, but in the 
case of concurrent VI schedules the response with the maximum probability of 
reinforcement changes from moment to moment; concurrent VI performance has 


therefore been called momentary maximizing. * 
Matching and maximizing seem to be contradictory alternatives. But they 
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keys, as shown by the bar at 2x marked A and B. The rate of response A is now 
lower, concurrent with B, than when response A was reinforced alone. 

The relation between the two rates is similar to that in behavioral contrast, 
but whereas behavioral contrast in multiple schedules is best regarded as an incre- 
ment produced during extinction (Figure 8-11, top), the interactions among 
concurrent schedules are best regarded as decrements produced during the 
reinforcement of another response (Figure 8-11, bottom). Experiments that 
Observed the effects on pigeons’ pecking on one key while responding and 
reinforcement were independently varied on a second key have demonstrated that 
the interactions of response rate observed within concurrent VI schedules depend 
directly on the reinforcers produced by each response, and do not depend on com- 
petition between responses for the available time (Catania, 1963b, 19692). These 
findings show that, although there are superficial similarities between multiple 
and concurrent schedules, it is risky to generalize from one to the other (cf. Killeen, 


1972). 


Initial Links 


Terminal Terminal 


Schedule B 


Schedule A 


Reinforcer Reinforcer 
f a concurrent-chain procedure as it might be 


Fi, i atic diagram 0. r 
LTE SE initial links, both keys are lit white (W), and 


arranged for a pigeon’s key pecks. In in } 
equal but in dependend VI schedules operate for both keys. According to its VI 


schedule, pecks on the left key produce terminal link A, in which the left key is lit 
green (G) and the right key is dark; in terminal link A, pecks on the green key 
produce reinforcers according to schedule A. Similarly, according to its VI 
schedule, pecks on the right key produce terminal link B, in which the right key is 
lit red (R) and the left key is dark;in terminal link B, pecks on the red key produce 
reinforcers according to schedule B. The relative rates of pecking the two keys 
during the initial links are preferences for the respective terminal links. For 

white key more often than the left white key 


example, if a pigeon pecked the right K e 
in the above example, it would be appropriate to say that the pigeon preferred 


schedule B to schedule A. 
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Variables that have small effects in Single-response schedules sometimes have 


large effects in Concurrent schedules. Concurrent schedules are therefore often used 
to study reinforcement Variables (e.g., 


delay of reinforcement: Chung, 1 
Concurrent schedules can make diffe 
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ent schedules is likely to come not from such 
its derivatives, but rather from more general 
se rate, such as the resistance of 


the diverse phenomena of reinforcem 
specific measures as response rate and 
properties of behavior independent of respon. 
behavior to change (Nevin, 1974). 
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Conditioning 


NINE 


includes a number of etymological clusters. For 
d anxiety share the same roots, as do choleric, 
and wrath and worry, and sad and satisfy (the last 
). Fear, from the Indo-European per-, to try, to 
has an extensive group of relatives that includes 
probability and opportunity, approach and 
ry synonyms for behavior, comportment 


The language of emotion 
example, eager, anger an 
melancholy, glad and glee, 
pair is also related to satiate 
Tisk, to press forward or lead, 
experiment and apparatus, 
deprivation, and two contempora 
and performance. 


ounter a topic that once dominated the psychol- 
ded the opening chapters of most learning texts. 


In addition to respondent conditioning, it has gone by such names as classical 
conditioning and Pavlovian conditioning, and the language of conditioned reflexes 
has to some extent entered the everyday vocabulary (although in popular usage it 
has often been confused with instances of learning in operant behavior). The term 
conditioned, from the Russian ycnosunf, might better have been translated as 
conditional, because the name was applied to reflexes conditional upon relations 
among environmental stimuli. Respondent conditioning is an instance of stimulus 
control applied to stimulus-presentation operations rather than to the contingencies 
in consequential operations. In other words, instead of signalling the consequences 
Of responding, a stimulus may simply signal the presentation of some other stimulus. 
Pavlov’s conditioned salivary reflexes are the prototype example: when a bell or 
a light repeatedly signalled food in the mouth of a hungry dog, salivation began to 
be elicited by the signalling stimulus as well as by the food itself. 

In discussing consequential operations, we had to speak of classes of 
responses rather than individual instances, because individual responses are never 
exactly repeated. We called these classes operants. Similar problems exist for 
elicited behavior. For example, successive elicitations of salivation by food may 


In respondent conditioning, we enc 
ogy of learning so much that it provi 
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CONDITIONAL REFLEXES 


either case can we attribute the salivation 
to food as an eliciting Stimulus: in the 
food was Presented, and in the second 
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extent that the bell is now a stimulus that has acquired the power to elicit saliva- 
tion, we say that we have created a new respondent class, salivation elicited by the 
sound of the bell. We call the relation between bell and salivation a conditional 
reflex because it is conditional on the prior relation of bell and food. 

The sequence of events is illustrated in Figure 9-1. The bell at first elicits 
Orienting responses, but these responses disappear with repeated presentations; at 
this point, the bell may be called a neutral stimulus. In an unconditional reflex, 
food elicits salivation; in this relation, the food is called an unconditional stimulus 
or US and the salivation is called an unconditional response or UR. Conditioning 
begins when the bell reliably precedes food; at this point, the bell still has no 
effect on salivation, and we may continue to regard it as a neutral stimulus. After a 
period of conditioning, a conditional reflex may be established; the bell may elicit 
salivation before food is presented (a), or the bell may elicit salivation even when 
food is omitted on an occasional trial (b). The bell may now be called a conditional 
stimulus or CS and salivation elicited by the bell a conditional response or CR. 

The difference between a conditional stimulus and an unconditional stimulus 
is not simply which comes first. If we reversed their order, we would not expect the 
eliciting effects of food followed by bell to be much different from food alone. In 
fact, the relative effectiveness of stimuli as conditional and unconditional stimuli 
may be predictable from the probabilities with which the stimuli elicit their respec- 
tive responses. For example, consider salivation elicited by food in a dog’s mouth 
and leg flexion produced by shock to the dog’s leg. A mild shock may become a 
conditional stimulus eliciting salivation if it is reliably followed by food, but a 
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Neutral Stimulus NA (Orienting Response) 
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US» UR 


Unconditional Reflex 
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Figure 9-1. Relations between stimuli and responses in respondent conditioning. 
An initially neutral stimulus (BELL: NS) is followed by an unconditional stimulus 
on. If the neutral stimulus begins to elicit respond- 


(FOOD: US) that elicits salivati t 
ing like that elicited by the unconditional stimulus, the neutral stimulus is called a 


conditional stimulus (CS). OR - orienting response; UR - unconditional response; 
CR - conditional response. 
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tioning in differential conditioning, a bell is OS ET NE 3s not. 
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Figure 9-2. Schematic presentati 
using a bell as an example of a c 


d its name from the assumption that the 
conditional stimulus had to leave some trace in the organism's nervous system for 
such conditioning to be effective. But successive presentations of the unconditional 
stimulus itself at regular intervals (e.g., every half-hour) also produce conditional 


delivered. Trace conditioning acquire 
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1 association of ideas, and পা 
tional reflexes seemed to represent a Primitive example of the formation he 
Associations, Jf iqeas Were ASsociated, jt was ATgued, then the occurrence 0 
Would lead to the other. 


T 
as having common elements % 
er in time. It tTemained then to Suggest that ideas could be a 
ted as responses Benerated by environmental events So that if events occur 
together in the Past reme: 


for another. 
dasa Proc 
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 teoretica] qeb iven 
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of interpreting instrumental OF Operant behavior as an instance 0 
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to finding Ways 
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& 43 imuli can 
constant, however, the relation between conditional and unconditional at o 
Vary. Assume that a bell (S1) and food (S2) are arranged within eo HO 
can ignore the stimuli that demarcate the trials. The rows in Figu 
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ons between two stimuli, $1 and $2. Each row repre- 
from a conditioning procedure. Each case involves 


exactly three pairings of S1 and 52, in trials 2, 5 and 7, but only in the first case is 
S1 positively correlated with 52. In the second case S2 is as likely given S1 as not 
given SL, and in the third case the likelihood of S2 is less given 51 than not given S1. 
Only in the first case will S1 be established as a conditional stimulus relative to 52 
as an unconditional stimulus. The probabilities of §2 given S1 and of S2 given no 


S1 are shown at the right. (Cf. Rescorla, 1967.) 


Figure 9.3. Conditional relati 
sents a sample of ten trials 


samples of ten trials from three conditioning procedures. In each, S1 is paired 
with S2 in trials 2, 5, and 7. In the top row, only those trials include 52, which is 
therefore perfectly correlated with S1: the probability of §2 given S1is 1.0, but it 
is zero without S1. In the middle row, §2 occurs in every trial, and therefore S1 
is irrelevant to whether S2 occurs: the probability of 52 is 1.0 given S1 or given no 
51. In the bottom row, S1 occurs in six trials, but in only half of these is it 
followed by S2, whereas $2 occurs in three-quarters of the trials in which 51 is not 
Presented: the probability of 52 given $1 (0.50) is lower than the probability of 
S2 given no S1 (0.75). Only in the first procedure is 51 likely to become an effec- 
tive conditional stimulus; in the last procedure, $1 might even reduce the likelihood 
of conditional responding elicited by the trial stimulus. The conditional relation 
between the two stimuli rather than the number of pairings is the appropriate basis 


for classifying conditioning procedures’ (Rescorla, 1967). 


Contiguity and consequences 


ve been established through respondent 
lov’s salivary conditioning is probably the 
strated conditioning based on such uncon- 
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Jectric shock (Bechterev, 1933). 


Many conditional responses ha 
Procedures (e.g., see Hull, 1934). Pav! 
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(Twitmyer, 1902) and limb withdrawal elicited by € 
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In fact, if S1 had elicited salivation even 1 v 00d, 

k ‘tioning. For example, a startle response might be elicited by a 

Wwe would speak of pseudoconditioning. PES ren Though these: stimull Had rover 


variety of innocuous stimuli after a traumatic shock, eV fl Le 
occurred together with the shock; thus, pseudoconditioning control procedures are sometimes 


incorporated into conditioning procedures, to make sure that observed effects depend on the 
relation between the conditional and unconditional stimuli rather than simply on the presenta- 


tions of the unconditional stimuli. 
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leg, a flexion could not prevent shock delivery, whereas if either or both electrodes 
were attached to the floor on which the dog’s paw rested, a flexion prevented or 
terminated shock delivery because once lifted from the electrodes the paw no 
longer completed the electrical circuit. In fact, the classic defensive reflex of 
Bechterev (1933) typically had both electrodes on a surface the organism touched, 
S0 that a response prevented or terminated the shock; Bechterev was therefore prob- 
ably studying avoidance and escape behavior rather than respondent conditioning. 
Recognizing the different implications of the two methods of electrode placement 
for the consequences of responding was an important step in the evolution of the 
distinction between operant and respondent behavior (cf. Schlosberg, 1937; 
Skinner, 1935; see also Kimmel, 1976). 

Once consequences became implicated in a few cases of presumed respondent 
conditioning, it was tempting to seek them in all cases. For example, even if both 
electrodes are attached to a dog’s leg in defensive conditioning, might not the 
flexion have consequences? Suppose that a bell reliably precedes electric shock. 
How can we tell whether the shock is as aversive when it passes through a flexed 
limb? In this instance of respondent conditioning, the conditional response is 
Ordinarily slower and different in magnitude than the unconditional response. 
Furthermore, if the dog’s leg is already flexed when shock is delivered, less postural 
adjustment is required than if the dog stands on all fours and then shifts its weight 
to the remaining three legs when flexing the shocked leg (Wagner, Thomas, and 
Norton, 1967). Clearly salivation too has consequences; it affects taste and swallow- 
ing in the case of dry food and dilution of the solution in the case of acid on the 


tongue. 
The introduction of respondent 
attempts to reduce all instances of opera 


conditioning into learning theory began with 
nt learning to special cases of respondent 
conditioning, but with such arguments the situation had bean turned around. The 
case was made that all instances of respondent conditioning could be interpreted 
in terms of response consequences that were earlier unnoticed. The next step was 
to observe that autonomic responses such as salivation and constriction or dilation 
OF blood, vessels were often accompanied by somatic Tesponses (i.e., action of the 
striate muscles, which produce skeletal movement). The position could then be 
taken that autonomic responses in respondent conditioning were artifacts, in that 
they incidentally accompanied behavior established through operant or instrumen- 
tal processes (K. Smith, 1954). The status of respondent conditioning therefore 
came to depend on demonstrations of conditioning that could not be interpreted in 
terms of the consequences of responding. Cs { | 
One approach was to see whether conditional responding could be modified 
by explicitly arranged consequences. If such consequences did not modify the 
conditional reflex, then the argument that the new reflex relation depended on 
other unidentified consequences would no longer be particularly convincing. 
Sheffield (1965) therefore added consequences to the conditional salivation 
generated by the classical Pavlovian situation. Specifically, a conditional stimulus 
preceded food, but food was omitted if the dog salivated on that trial. (The proce- 
dure is an example of negative punishment sometimes referred to as omission train- 
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Figure 9-4. Schematic diagram of the phases of sensory preconditioning and 
second-order conditioning procedures, using bell and tone as conditional stimuli 
and shock-elicited leg flexions in a dog as the unconditional response. 

hock. Once the conditional reflex is established, 
Jiciting effects of the bell are tested. Leg flexion 
t the bell became a conditional stimulus 


In the figure, tone is followed by 5 
SO that tone elicits leg flexion, the e 
to the bell is then taken to mean thal 


relative to the tone during preconditioning. ন 
In second-order conditioning, the order of phases is reversed. In the example 


in Figure 9.4, a conditional reflex in which tone elicits leg flexion is established 
first, by presenting tone and then shock. Later, bell is followed by tone. In this 
instance, the question is whether the conditional stimulus established in the first 
Phase can function as a unconditional stimulus for another stimulus in the second 
Phase. The difficulty is that the tone may lose its effectiveness as a conditional 
Stimulus as it is repeatedly presented without shock in the second phase, but during 
just this time the bell must acquire its conditional properties. An alternative proce- 
dure, in which bell is followed by tone and then by shock on all trials within a 
single phase, would be ambiguous; if the bell did come to elicit leg flexions we 
would not know whether these occurred because of the relation between bell and 
tone or because of the relation between bell and shock. 

If the conditional stimulus is presented alone after sensory preconditioning, 
thereby extinguishing the conditional reflex, the preconditioning stimulus also will 
no longer elicit a conditional response. In other words, in the example of Figure 
9.4, presenting tone alone after the conditioning phase until the tone no longer 
elicits flexions will also make the bell lose its effectiveness in eliciting flexions. 
But the comparable procedure after second-order conditioning does not extinguish 
the conditional response of the second-order conditional reflex (Rizley and 
Rescorla, 1972). In the example of Figure 9-4, presenting tone alone until it no 
longer elicits flexions will not eliminate the conditional flexions to the bell estab- 
lished during the second-order phase. This outcome is paradoxical. Consider an 
imaginary human case history. A young man sees blood in painful circumstances, 


and the sight of blood therefore becomes 2 conditional stimulus that elicits those 


emotional responses we call fear. At a later time (equivalent to the second-order 


Phase), the young man sees blood in an elevator, and thereby acquires a fear of 


209 


210 Respondent Behavior: Conditioning 


elevators. The young man later goes into hospital work, and in that context 
gradually gets over his fear of the sight of blood. According to Rizley and 
Rescorla’s findings, this change will not reduce his fear of elevators, even though 
the conditional response to the blood had been the basis for that fear. This case is 
imaginary, however, and we must be Cautious in the generalizations we draw from 


tute for another. 
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Figure 9-5. Schematic illustrations of overshadowing and blocking. In overshadow- 
ing, a conditioning procedure in which neither of the two stimuli of a compound 


had a prior history of conditioning establishes only one of the two stimuli as a 
makes one of the stimuli more effective than 


the Renal stimulus (or, alternatively, 1 yg y 
e other). In blocking, conditioning is established with respect to one stimulus, 
and this history prevents the other stimulus from becoming effective as a 
conditional stimulus when the stimuli are presented as a compound (or, alterna- 
tively, reduces the effectiveness of the second stimulus). S1 and S2 - stimuli; US - 
unconditional stimulus; CR - conditional response. 
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When they do 50, they sometimes acquire the 


capacity to reduce the effectiveness of conditional stimuli, and are described as 
inhibitory. These effects can be measured most directly with stimuli presented in 
combination. An example is provided in Figure 9-6, which illustrates a condition- 
Ing procedure involving food-elicited salivation in a dog. First, a bell is established 
As a conditional stimulus (cS). Once the bell comes reliably to elicit salivation, 
2 tone is presented either by itself or together with a light on irregularly alternating 
trials. When tone (CSB) is presented alone, it is followed by food. When it is pre- 
sented with the light (CS), the food is omitted. Eventually tone alone elicits 
Conditional salivation, but the pairing of tone and light does not. It might be 
assumed that the dog simply discriminates tone alone from the tone-plus-light 
combination. But the inhibitory effect of the light is demonstrated when later pre- 
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stration of respondent conditioning of the pigeon’s key peck, in a procedure called 
autoshaping (P .L. Brown and Jenkins, 1968; Hearst and Jenkins, 1974; Schwartz 
and Gamzu, 1977), received special attention. In addition, because the key peck was 
a2 commonly chosen response for the study of consequential responding, it was 
important to determine the extent to which respondent processes might enter into 
those performances. 

The autoshaping procedure was introduced as a convenient alternative to 
shaping the key peck by successive approximations (chapter 6). Autoshaping begins 
With a pigeon that eats reliably from the feeder but has not pecked the key in a 
standard chamber. At irregular intervals the key is lit, and several seconds later the 
feeder is operated independently of the pigeon’s behavior. Thus, the lit key 
becomes a discriminative stimulus (or conditional stimulus) that signals another 
stimulus, food. The food occasions eating (and in the pigeon eating includes 
pecking). We therefore might say that food is an unconditional stimulus and that 
pecking food is an unconditional response. After a few presentations of lit key and 
then feeder, the pigeon begins to face and move toward the key when it lights. 


Within perhaps fewer than ten and rarely more than one hundred trials, the pigeon 
Will begin to peck the key whenever it is lit. After pecking is established by auto- 


shaping, the continuation of the procedure is called automaintenance. Autoshaping 
and automaintenance simply distinguish the changes in behavior leading up to the 
first peck from the maintained behavior following this first peck. | 

In autoshaping and automaintenance, food deliveries occur independently of 
behavior, It is therefore difficult to attribute autoshaped pecking to its conse- 
quences. Nevertheless, once such pecking begins it will often be followed by food. 
An argument based solely on the observation that no consequences of pecking are 
Obvious may not be persuasive. Autoshaped pecks were therefore studied in an 
Omission procedure analogous to Sheffield’s experiment with salivation: food was 
delivered after the key light only on trials in which the pigeon did not peck the key 
(0D. R. Williams and Williams, 1969). As with salivation, pecks were maintained ina 
Substantial proportion of trials even though those Pecks caused the omission of 
00d. Key pecking persisted over hundreds of trials, the behavior presumably 
Stabilizing at a level at which enough trials occurred without pecks (and therefore 


With food intai ins in the other trials. 
ESET d to a hungry pigeon, pecking becomes a 


When food is repeatedly presente h 
More and more dominant component of behavior as time passes between food 
Presentations (cf. chapter 3 on interim and terminal behavior). Autoshaped key 
Pecking in the pigeon may therefore be interpreted as a a Te 
Stimuly ‘Ht. Decks generated by repeated food presentations 

s Sonim of ths Key ght, and to be directed toward 


tend to satan f the key li 
occur mainly in the presence 0 
this Stimulus so strongly that they strike the key and are recorded as key pecks. 


ice autoshaped pecking begins, it may be maintained indefinitely by repeated 


Presentations of key light and food even though it has no obvious consequences (in 
Act, relative to keeping its head in or near the feeder, the pigeon S key pecking may 


elay its a 
ccess to food). bade i 
The production and maintenance of key pecking in autoshaping and auto- 
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maintenance has the critical features that define respondent conditioning. The 


respondent vocabulary is therefore appropriate. We may speak of the key light 
as a conditional stimulus. It ac 


key pecking, through its correlation with food. Food is therefore the unconditional 


Whereas when autoshaped 
inking the key. 
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Some empirical findings with various stimulus combinations in conditioning, 
including sensory preconditioning, second-order conditioning, overshadowing and 
blocking, and inhibitory stimuli in stimulus compounds. We concluded with an 
example of the conditioning of a skeletal response, the pigeon’s peck, in 
autoshaping and automaintenance. 


OPERANT-RESPONDENT 
INTERACTIONS 


Behavior can sometimes be analyzed as a combination of operant and respondent 
Processes. These processes can interact with each other when respondent-condition- 
ing procedures are combined with operant or consequential procedures. For 
example, a stimulus that reliably precedes or signals an electric Shock may itself 
come to elicit responses such as leg flexions, but it may also interrupt other 
behavior that has been maintained by its consequences, such as lever presses main- 
tained by food reinforcement. We sometimes describe comparable behavior in 
humans in terms of fear or anxiety; procedures like these have often been regarded 


4S relevant to emotion. 


Conditioning and emotion 


Stimuli that signal the presentation of other stimuli may be superimposed on 
baselines of ongoing operant behavior. For example, suppose that a rat’s lever 
Presses are maintained by an interval schedule of food reinforcement; from time to 

d of the tone shock is delivered. In these 


time a tone is presented, and at the en : \ | 
reduces lever pressing, especially as the time 


Circumstances, the tone typically ; ol 
approaches when shock will be delivered. This phenomenon, originally 
demonstrated by Estes and Skinner (1941), has been given various names: anxiety, 
conditioned suppression, and conditioned emotional response or CER. It is 

] Jopment of suppression and then 


illustrated in Figure 9-7, which shows deve I 
recovery from suppression after shock was discontinued (Geller, 1960). The rat’s 
‘min schedule of food reinforcement, and 


lever presses were maintained by a VI 2 
three-minute presentations of tone were followed by shock. After the tone came to 
Suppress responding, shock was discontinued, and lever pressing In tone recovered 


to earlier levels. BCT te ল্‌ 
The procedure is an instance of respondent conditioning: one stimulus, tone, 
signals another stimulus, shock. In this instance it is not obvious that tone elicits 
responses similar to those elicited by shock. The tone suppresses reinforced lever 
pressing, but lever pressing begins again soon after shock has been A If we 
looked more closely at the rat’s behavior during the tone, we would ind that many 
Other classes of responses were affected besides lever pressing (i.e., heart rate, 
respiration: cf. Rescorla and Solomon, 1967; Blackman, 1977). We are most likely 
n an event has wide-spread effects across 


to invoke the language of emotion whe 
many different response classes, and we may therefore be tempted to speak of the 
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rat’s fear or anxiety. But if we do so, we must recognize that such terms do not 

explain the rat’s behavior. It would not do to say later that the rat stopped pressing 

SE NSSNSE SSE LoS because it was afraid; the effect of the tone on the rat’s lever 
pressing led us to speak in terms of the rat’s fear in the first place. 

Our language of emotions is complex. We speak of emotions in others and in 
Ourselves on the basis of both situations and the behavior that occurs in those 
situations (e.g., Bem, 1967). For example, we might speak of the behavior 
produced by preaversive stimuli in terms of fear or anxiety, but if we observed 
aggressive behavior directed toward another Organism we would be more likely to 
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ন ponse pen was also displaced 
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S On the right show recovery from su i 
pression 
as no longer followed by shock. (Adapted from Geller 1960, 


the person is acting that way because 
Job or the breaking up of a love affair. 


Preaversive and preappetitive 
stimuli 


Although the language of emotion is important in our interactions with other 
people, it has not proved particularly useful in behavioral analyses of the effects of 
preaversive stimuli. Instead, the interactions between respondent conditioning and 
operant behavior, as when preaversive or preappetitive stimuli are superimposed 
On reinforced responding, have been analyzed more effectively in terms of various 
experimental parameters, such as baseline reinforcement schedule, baseline response 
rate, and so on. 

The finding that positively reinforced responsing may be suppressed by 
preaversive stimuli (Estes and Skinner, 1941) was later supplemented by the finding 
that negatively reinforced responding (avoidance: see chapter 5) may be enhanced 
by such stimuli (Sidman, Herrnstein, and Conrad, 1957). In other words, a rat 
Whose lever presses avoid electric shocks may increase rather than decrease its lever 
pressing during a stimulus that precedes an unavoidable or inevitable shock. This 
enhanced responding has been given a variety of names, including conditional or 
conditioned enhancement or facilitation or acceleration. Once such enhanced 
responding develops during negatively reinforced responding, it may continue with 
responding maintained by positive reinforcement. For example, rhesus monkeys’ 
lever pressing maintained by orange juice reinforcers was originally suppressed in 


the presence of a clicking noise that preceded shock, but after the monkeys 
that avoided shock and were returned to the 


acquired a history of lever pressing k ৰ 
initial procedure, lever pressing during the preaversive stimulus was enhanced rather 
than suppressed (Herrnstein and Sidman, 1958). | a 
The situations were then extended to superimposing preappetitive rather than 
behavior. For example, a key light that 


Preaversive stimuli on baseline operant ¥ fl ঠ 
Preceded response-independent food deliveries increased pigeons’ key pecking when 
king maintained by DRL reinforcement 


this stimulus was superimposed on key pec ন NOT oir 

(Herrnstein and Morse, 1957). Just as suppression during preaversive stimuli might 
be spoken of in terms of anxiety, it was tempting to speak of such enhancing 
effects of preappetitive stimuli in terms of joy. It ‘was then ESSUmeC UNH the 
Suppression of positively rein nhancement of oY reinforced 
responding by preaversive stimuli might be paralleled Ee of 
Positively reinforced and suppression of negatively Sng responding by 
Preappetitive stimuli (e.£., Leitenberg, 1966; Rescorla an fl omon, 1967). But, 
aside from the problems of distinguishing such enn লা FE EEN 
conditioning (e.g., autoshaped key-pecks) and of clear y I Fn between 
Positive and negative reinforcement (cf. chapter 5), Eb EE RG 
and preappetitive stimuli showed that such an account ber টি B OGG. Ble Ee 
tion (e.g., Kelleher, Riddle, and Cook, 1963; Azrin an ake, ; Blackman, 
1977). at shock level and baseline response rate 


For example, 


forced and ¢ 


Figure 9-8 shows th 
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jointly determine whether preaversive stimuli suppress or enhance a rat’s lever 
pressing reinforced with food (Blackman, 19682). In the presence of red light and 
noise, lever presses were reinforced according to a DRL 15-sec schedule with LH 
5-sec (i.e., a press was reinforced only if emitted within 15 to 20 sec of the last 
press); in the presence of white light and no noise, lever presses were reinforced 
according to FI 20-sec with LH 5-sec. In these multiple DRL FI schedules, the DRL 
component maintained about 5.5 responses per minute and the FI component 
maintained about 32 responses per minute. Later, occasional one-minute tones that 
preceded brief shock were added; shock level was varied to determine the relation 
between shock magnitude and degree of Suppression. Figure 9-8 (left) shows 


absolute rate of tesponding in tone (the preaversive stimulus) as a function of shock 
level. In the FI component, response rate consistent 


shock level. In the DRL component, however, respon: 


levels and decreased Only at higher shock levels. Figu 
data converted into a Suppression ratio 


ly decreased with increasing 
Se rate increased at low shock 


» the change in response rate expressed 


may vary considerably de 
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Figure 9-8. Absolute rate of responding (left) and suppression ratio (right) in the 
Presence of a preaversive stimulus as a function of the level of shock that followed 
the preaversive stimulus. The data shown are from one of three rats whose lever 
pressing was maintained by multiple DRL FI schedules of food reinforcement. With 
FI responding, the preaversive stimulus suppressed. responding at all shock levels; 
with DRL responding, however, the preaversive stimulus enhanced responding at 
low shock levels and suppressed it only at higher shock levels. (Adapted from 


Blackman, 19683, Figure 2.) 
preaversive stimuli occurred more rapidly the lower 


he higher the baseline reinforcement rate. 
muli are determined not only by properties of 


baseline performance, but also by properties of the schedule of stimulus presenta- 
tion. For example, suppression varies with both the duration of a preaversive 


stimulus and how often the stimulus is presented within a session. The degree of 
Suppression may depend in part on how much the reduced rate of responding 
affects rate of reinforcement: less suppression occurs if the reduction in responding 
Breatly reduces the number of reinforcers in the session, and more suppression 
Occurs if the reduction in responding only slightly affects the number of reinforcers 
(Stein, Sidman, and Brady, 1958; see also Meltzer and Brahlek, 1970; J. B. Smith, 
1974). As with the simpler instances of respondent conditioning earlier, the BHfEcls 

n operant behavior depend not simply on 


Of preaversive and preappetitive stimuli 0 I s 
the pairings with Le or appetitive stimuli, but rather on the differential 


Correlation of these stimuli. The point is illustrated in Figure 9-10 (See also Figure 
9.3), which shows suppression of 8 rat’s reinforced lever pressing produced by 
various combinations of shock probabilities in the presence and absence of 


= i d Mclntire, 1969). F 
is Affe 1968; see also Davis an ্ For 
Preaversive stimuli (Rescorla, 5 ens presentations are pat ES 


example, i reaversive 

CA or ns CS = 0.40), a range of effects from complete 

SUpptesion t uppression at all can be obtained, depending on the probability 

Sia n to no Supp sive stimulus is absent. (The differential correlations are 
ock when the preave e value: the preaversive stimulus is said 
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predictive value when shock is as probable in its 
Presence as in its absence.) 

The phenomena Occurring when one stimulus signals another stimulus are 
Phenomena of stimulus control: responding in the 
stimulus is affected by its relation to the stimulus 
Summarize these effects in any sil 
respondent conditioning, i 
substitute for the other, b 
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p(Shock/no CS) 
Figure 9-10. Suppression in the presence of a preaversive stimulus (CS) with 
different shock probabilities in the presence and absence of the preaversive 


stimulus. For example, when the probability of shock after the preaversive stimulus 
| of rats’ lever pressing depended 


was 0.4, or pP(SHOCKICS) = 0.4, the suppression Lr ট 
on the probability of shock in the absence of the preaversive stimulus, or 
P(SHOCK/no CS). Effects ranged from complete suppression when the latter 
Probability was zero, to no suppression when the probability was equal to 
P(SHOCK/CS). Note that the same data are plotted in both halves of the figure: 
P(SHOCK/CS) is the parameter on the left, and p(SHOCK/no CS) is the parameter 
On the right. In this suppression ratio, baseline equals 0.5. (Adapted from Rescorla, 
1968, Figure 3.) 

to say that its aversiveness comes about 
to say that this aversiveness is necessary 


g. We are perhaps still a long way from 


and Fleshler, 1962), it still is one thing 
through respondent processes and another 
to the acquisition of avoidance respondin 
resolving such issues. 


RECAPITULATION: A BEHAVIOR 
PARADIGM 


We have considered a variety of Behe 
elicitation, reinforcement, discrimination, 


We apply these concepts to more complex bela J 
review seems appropriate. We now consider a paradigm in terms of which many 


relations in the preceding chapters can be summarized. A AE is a model, a 
Braphic or symbolic representation of relations among ৰ EG Ours 
Will represent possible relations among stimuli oe So on: er ট AE 
Provide a more formal organization for some on P রর EA ore TE 
The form of the paradigm is: SD(R:SC), where SD is a discrt oy h i stimulus, R 
is a response, and SC is a contingent Stimulus. The in the parenthesis 
represent the relation between R and SC; the parent hs et EO 
the effect of a response on the probability of the stimusus. Se 1 0 TRS 
therefore, is not necessarily a consequence of the To 0 ৰ t be a 
Stimulus prevented by the response, 4s WhED: E03 by Ss shock in 


1 procedures and findings: phenomena of 
and conditioning, among others. Before 
types of behavior, such as language, a 
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avoidance). A contingent stimulus might be a stimulus delivered in a stimulus- 
presentation operation (in this Case, the response has no effect on stimulus 
probability), or it might be a stimulus in the consequential operations of reinforce- 
ment or punishment (in these cases, the Tesponse raises or lowers stimulus probability). 

The following development builds the paradigm from its components. 
Consider first the simplest experimental operation beyond mere Observation, 
stimulus-presentation (chapter 3). We represent this operation as: (SC). No signal 
Or warning stimulus precedes the contingent stimulus and no response need occur 


nted. Suppose we Simply present food pellets 
dently of behavior. Each food-pellet delivery 
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To recapitulate, the several operations may be represented as follows: 


(SC) Stimulus-presentation operation 
(R:SC) Consequential operation 
SD(SC) Stimulus-control operation superimposed on 


stimulus-presentation operation 
SD(R:SC) Stimulus-control operation superimposed on 
consequential operation. 
ssion of experimental operations in chapter 2; 
perimental examples presented there. 

In examining any experimental procedure in the analysis of behavior, it is 
often a useful exercise to describe the procedure (or, if it has separate stages, its 
Various components) in terms of the appropriate operations. Sometimes a proper 
description requires a combination of operations. For example, assume that a rat’s 
lever presses in light produce tone, and its lever presses in tone produce food 
pellets. Both stages can be represented as SD(R:SC), a stimulus-control operation 
superimposed on a consequential operation. The response R, the lever press, is 
common to both stages, but different stimuli serve as SD and as SC in each stage. In 
the first stage, light is SD and tone is SC; in the second stage, tone is SD and food is 
SC. The tone illustrates that a stimulus that serves 85 # contingent stimulus in one 
part of a procedure may serve a5 4 discriminative stimulus in another part of the 


Procedure (chained schedules: chapter 8). 


This organization parallels the discu 
the reader may wish to review the ex 


Kinds of contingent stimuli 
muli in terms of their relations to responses, 


t’s food pellet as an example of a contingent 
and the nature of the stimulus can 


We have defined contingent sti 
and the preceding account used the ra 


i imuli i ts 
Stimulus. stimuli are of various sorts, : \ 
eG effects. In each of the preceding cases, We could substitute electric 


shock for the rat’s food pellet. We could present the shock independently of the 
Ne r the 56); could present it only after some response, such as a lever 
Press (RSC); না coiild present it only in the presence of some other stimulus, 


sent it after some response, such as 
5 R D(SC); or we could present ’ 
UCch as a warning tone, S (SC); occurred in the presence of some other 
a lever press, but only i 


f this response a 
stimulus, such as a warning tone, SD(R:SC). UE borates তৰ behavior in 
these cases to be the same as that with food Br i ্ distinguish Et 

It is convenient, for Some purposes, Ee Fe etimes called aopelitie EE) 
types of contingent stimuti. There are stimuli, Calis: OU 2 ন Fo: - 
ing, or reinforcing, that organisms a Toney e 
Presence of: food, entertaining companys i 


among many others. There are 
oth ‘nes called aversive, noxious, Or punishing, that organisms will 
er stimuli, sometime iC S| 
: electric 
Work to remove or to stay away EL 


hock, dull company, extremes of 
h others: In both cases, some stimuli have obvious 
De or cold, among many thers appear to have acquired their significance during 

iological significance while © n that it is difficult to classify any stimulus 
the organism’s lifetime. nisher (chapters 4 and 5). And, having 
unambiguously as a rein f contingent stimuli, we must 
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admitted appetitive and avers 
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also recognize that relatively neutral or insignificant stimuli may enter into 
contingencies. If the rat lifts its paw and touches the chamber wall, for example, 
this movement is a response, and contact with the wall is its consequence. 
Obviously, no stimulus is likely to be completely without significance, and we must 
recognize that these classes represent points or regions in a continuous range of 
stimulus types rather than three discrete categories, and that the designation of 
particular stimuli can change as a result of such Operations as deprivation. 


Kinds of contingencies 


A response may lower as well as raise the probability with which events 
occur. Suppose that shock is delivered to a rat at the end of every 10-second period 
unless the rat presses a lever. The probability of shock is 1.0 at the end of 10 
seconds if the rat has not Pressed the lever, but it is zero if the rat has pressed 
the lever; the lever press in any l0-second period reduces shock probability from 
1.0 to zero. Because a contingency is defined as the effect of a response on the prob- 
ability of a stimulus, we may distinguish among Contingencies in terms of whether 
the effect is an increase or a decrease in probability. 

Consider some exam 


shock, the stimulus is aversive 


a rat’s lever presses may 


t 
will be delivered. EE RUUD ood Pet 


যর i in EE are, of course, not limited to all-or-none cases, +1.0 
on i examp. €, 2 rat’s lever press may produce a food pellet only one-tenth 
€ time (e.g., ratio schedules: chapter 8). In this Case, the response raises 

m 0 to 0.1. Similarly, a rat’s lever press may prevent 


intervals. In this instance, the 
eS from 1.0 to only 0.5. i 


» but this means that shocks 

5 if a response Occurs, it is not 
inulis 5 ion between respo stim- 

Stimuli; we are interested in the succession of re. POnsesing 


represent the environmental 
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In classifying contingent stimuli, we recognized relatively neutral or insignifi- 
cant stimuli as well as appetitive or aversive stimuli. The same consideration is 
appropriate in the case of contingencies. The limiting case of a contingency is one 
in which a response has no effect on stimulus probability. For example, we may 
deliver food or shock to a rat independent of its lever presses. These are simple 
stimulus-presentation operations, but the notation (R:SC) emphasizes the relation 
between stimulus-presentation operations and consequential operations. Thus, 
(R:SC) could represent the response-independent delivery of appetitive stimuli such 
as food pellets, or aversive stimuli such as electric shocks, or relatively neutral or 


insignificant stimuli such as lights or tones. 


Classification of contingencies 

and contingent stimuli 

But a limitation to the paradigm must be recognized. In some contingencies, 
it is difficult to decide whether the change involves presentation or removal of a 
Stimulus. For example, is the response-produced change from a cold to a 
comfortable environment to be counted as positive because heat has been added 
to the environment, or as negative because the sensory effects of cold have been 
terminated? Contingencies involve changes in the environment, and we must 
recognize that there will be ambiguous Ca: 
consider whether the distinction between positive and negative reinforcement or 
between positive and negative punishment is worth maintaining. The important 
distinction is that reinforcement increases responding whereas punishment 
decreases it; whether these increases Or decreases come about through presenting 
Stimuli or taking them away is perhaps less critical (cf. chapter 5). Kk 

Kinds of contingencies and contingent stimuli are SES in SULT 9-1, 
in which they are shown combined with a discriminative stimu’us, - These 
instances are shown with some representative names applied to them in the 
Psychology of learning. The specific procedures are hE SEN of 
names is therefore incomplete. For any procedure in the pyc ন 5] US 
it is instructive to locate it or its various Stages ONS the Classen T6970 YT, 


Relations among responses 

An important property of a contin 
Occasions. We must distinguish the response 
responses occasioned by the contingent stim", 
‘Involved production of food pellets by a rats 
Specified in the contingency; other responses Suc 
chewing, salivating, and swallowing are OccHSIONS 
effects of contingencies may depend in part on the rel 
Classes. 


gent stimulus is the behavior that it 
s involved in contingencies from the 
lus. For example, if the contingencies 

lever presses, the lever press is 
h as taking the food in the mouth, 
d or elicited by the pellet. The 
ation between these response 


lative probabilities of these responses in situa- 
erested in the responses of running 
ch the rat engaged in when both a running wheel 


ilable. We may expect an opportunity to run to 
* bable than eating, but not otherwise. 


It 


First, we may examine the ৰ 
If we were int 


tions in which both are possible. 
and of eating, we might see whi 
and food pellets were freely ava 
reinforce eating if running is more PIO 


TABLE 9-1 
Kinds of contingencies and contingent stimuli. Entries are representative classes of 
experimental procedures; those in parentheses illustrate cases in which a discrimina- 
tive stimulus is superimposed on the contingency. 


Type of contingent j 
StHIrUIGs Type of contingency 

Response raises Response does not Response lowers 
probability of affect probability probability of 
stimulus of stimulus stimulus 

Appetitive, rewarding, Positive Stimulus Negative 

or reinforcing reinforcement presentation punishment 
(operant (respondent (omission 
discrimination) conditioning) training) 

Relatively neutral, Sensory Stimulus Sensory 

Or insignificant Consequences presentation consequences 
(latent learning) (sensory (latent learning) 

preconditioning) 

Aversive, noxious, Positive Stimulus Negative 

Or punishing punishment presentation reinforcement 
(discriminated (defensive (discriminated 
punishment) conditioning) avoidance) 

Conversely, 
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is the simplest operation, (SC). But, because responses can have environmental 
effects, we must also examine consequential operations, which are specified in 
terms of contingencies, (R:SC). A contingency is the effect of a response on the 
Probability of a stimulus. In addition, either stimulus-presentation operations or 
Consequential operations can be signalled, i.e., can take place in the presence of 
particular stimuli. Thus, we must also be concerned with stimulus-control 
Operations, whether these are superimposed on stimulus-presentation operations, 
SD(SC), or on consequential operations, SP(R :SC). These various operations can 
involve contingent stimuli that are appetitive, aversive, or relatively insignificant. 
Further, the contingencies into which they enter can involve increases in stimulus 
Probability, decreases in stimulus probability, or no change in stimulus probability. 
The last special case of contingency is equivalent to a stimulus-presentation 
Operation: (R:SC) = (SC). 


These classifications do not guarantee that any stimulus or any response will 


have only a single function; a stimulus in a contingent relation with one response 
may be in a discriminative relation with another, and a response elicited by one 
Stimulus may be in contingencies with other stimuli (for example, the mother’s 
Presence may be a contingent stimulus, when she comes at the infant’s cry, and a 
discriminative stimulus, when the infant learns that things happen in the mother’s 
Presence that do not happen in her absence; the infant’s cry may sometimes be 


elicited by events, such as painful stimuli, while at other times it may occur because 
are its consequence). We come to understand 


Such events as the mother’s presence চ 
behavioral situations by separating the various stimulus and response relations that 
enter into them. That is the business of an experimental analysis. In studying 
behavior, we must analyze situations to determine which features of an operation 


Were important and which aspects of responding were affected byit. 
On close examination, some distinctions implied by the paradigm seem to 


diminish in importance. In the analysis of behavior, classifications of events often 
have fuzzy boundaries, and distinctions may then become arbitrary. We noted such 
2 case with respect to presenting Or removing stimuli (e.8., is food effective as a 
Teinforcer by virtue of its presentation or because it terminates stomach 
Contractions or other events that may be correlated with hunger?). In the final 
analysis, we may completely discard the distinction between presenting and 
removing stimuli, recognizing that all consequential effects of responding can be 
Characterized simply as environmental changes. We might then note that every 
Procedure takes place in some environment, and so we might next rid of 
discriminative stimuli, observing 25 we did so that SD can be incorporated into the 
definition of the response. Thus, if a pigeon’s key pecks produce Ea the 
Presence of green but not red light, the response of the OE NG 
to include only those pecks emitted in green, and we can simp TY ff 50 igm to 
(RSC), where R represents key pecks in the presence of green an. represents 
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Our language words have diverse sources. Verbal, through Latin, and word 
through Old English, are derived from a common Indo-European root wer-, 
to speak. The Germanic spek- or sprek-, from which comes the German Jie 
Sprache, speech or language, leads to the English speak and speech. The 
Greek legein, to speak, and logos, word, lead to lexical, legible, and such 
relatives as logic and intelligent. Latin provides language and linguistics 
through lingua, tongue, and vocal and vocabulary, through vox, voice. k 


t our everyday vocabulary of language includes many 
assumptions that stand in the way of a behavioral account. For example, consider 
the common term word. When we speak of words in language, We seldom bother 
to distinguish spoken words from written words. Yet speaking a word is not the 
same as writing it, and the two responses are likely to occur in different circum- 
stances. Even worse, we often speak of using words, as if these were manipulable 


Objects instead of behavioral events. f f 
Particular words are uttered or written in particular circumstances. We do 


not ordinarily say that we utter nouns in the presence of relevant objects, or that 
Wwe utter sentences in the presence of relevant events. Instead, we tend to say that 
Words refer to objects, or that sentences are about events. There is good reason for 
this usage in the everyday vocabulary; as we shall see, it 1s appropriate to the effect 
Of language on a listener or 8 reader. But this vocabulary may be misleading when 
applied to an analysis of a speaker’s or a writer's behavior. 

An analogy may clarify the point (cf. MacCorquodale, 1970). Some ancient 
accounts of vision assumed that vision depended on emanations from the eye 
making contact with the object, rather than on light from an object entering the 
eye. The direction of effect, from eye to object or from object to eye, was critical 
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to subsequent progress in vision and optics. The distinction between the two direc- 
tions is sometimes implied in the vocabulary of vision (e.g., as when we distinguish 
between something seen and something looked at), but at least the vocabulary does 
not prejudice an account of visual phenomena in one direction or the other. 

This is not the case in the vocabulary of language. We Speak of language as if 
it were directed toward environmental objects or events: we Say that words or 
sentences refer to, deal with, speak of, call attention to, or are about things. The 
language of reference implicitly includes the direction from verbal behavior to 
environment. A vocabulary emphasizing the opposite direction is not obviously 
available in the everyday language of verbal behavior. We must therefore consider 
the possibility that the everyday vocabulary of language has prejudiced the ways in 
which we can analyze verbal behavior. 

Another part of the everyday vocabulary that complicates the task of 
analyzing verbal behavior is the language of meaning. We say that utterances convey 
meaning. But dictionaries do not contain meanings of defined words; they just 
contain other words. It is not useful to say that an utterance conveys a meaning 
unless something also can be said about the sources of the meaning. The trouble 
is that meanings are not identifiable as events. In fact, the concept is derived from 
verbal behavior, and it is therefore inappropriate to use it in turn to explain verbal 
behavior. 

Accounts of language in terms of such concepts as meaning are, inevitably, 
theories of language. To be effective, theories must classify the events with which 
they deal. A primary task of the analysis Of language, therefore, is classifying verbal 
behavior. But if meaning and related concepts provide an ambiguous basis for 
classifying verbal behavior, what properties of verbal behavior should take their 
place? 

Grammatical classifications exist, but the grammatical properties of the words 
in a sentence will not help us to distinguish among the circumstances in which a 
sentence might have been generated or the consequences its production might have 
for a speaker, a writer, a listener, Or a reader. For example, a sentence such as “The 
Psychologist talked to the chimpanzee” has the same grammatical properties 
whether uttered by the observer of a human-simian conversation, written as 
memorized by a student taking an examination, overheard by a passing clinician in 
a psychiatric ward, or simply encountered as an arbitrary example in a text. In all 
of these cases, we would class psychologist and chimpanzee as nouns and would 
note the different relation each has to the verb talked; yet the cases are worth 
distinguishing, even though they are grammatically identical. 

Describing the preceding sentence in terms of grammatical properties of its 
components deals with the structure of verbal behavior. But distinctions made on 
the basis of the circumstances in which a sentence can occur are distinctions of 
function. A functional account of verbal behavior is based on how verbal responses 
work: conditions under which they occur and consequences they may have. As 
with nonverbal behavior, we will not have to choose between structural and func- 
tional accounts of language, because the two accounts complement each other. 
Unfortunately, verbal behavior has been controversial in the history of psychology, 
and structural and functional accounts have often been pitted against each other as 
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if they were incompatible instead of complementary (cf. Titchener, 1898; Catania, 
1973c; Segal, 1977). This text attempts to deal consistently with both kinds of 
approaches. The present chapter outlines a functional account of verbal behavior 


(Skinner, 1957). 


FUNCTIONAL PROPERTIES OF 
VERBAL BEHAVIOR 


ed by the occasions on which they occur and the 


consequences they produce. They can be occasioned by EET TETDEOTDODNYETNT 
stimuli, and they can have either verbal or nonverbal consequences. For example, 
a child might say “apple” in the presence of either the written word or an actual 
apple; as a consequence of saying “apple,” the child might be given EIU the veal 
reply, “right,” or the apple. The following account argues for the importance of 
these kinds of distinctions. First, we consider some relatively simple formal 
relations, in which verbal behavior is reproduced or imitated in either the vocal or 
the written mode (e.g., as in echoing what someone has Said); note the distinction 
between the general term, verbal, and the term specific to spoken EE ord 
These cases will be followed by a treatment of verbal Bele OUT CET ECON ন 
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the stimulus, but these movements are not themselves sounds. The properties of 
the heard sound pattern and the movements of the vocal apparatus that echo that 
sound pattern are only indirectly related. How does the child who hears sounds like 
“mama” or “dada” come to make precisely those vocal responses heard as the same 
sounds by the parents? 

The speaker’s articulation of sounds, the physical properties of the acoustic 
stimulus, and the listener’s response to these sounds are intricately related. For 
example, many consonants of English (p, b, t, d) cannot even be produced unless 
they are accompanied by a vowel, and their physical properties vary as a function 
of the context of sounds within which they occur (e.g., Lane, 1965; Liberman, 
1970). The capacity for echoic behavior must depend in some way on the shaping 
of vocal apparatus movements by the acoustic consequences of vocalization. To 
the extent that these consequences engender and maintain vocalization, we may say 
that these consequences are reinforcing, but we must recall that the concept of 
reinforcement describes and does not explain. Thus, we may regard native-language 
speech sounds as reinforcing relative to nonnative-language speech sounds, because 
native-language sounds are ordinarily maintained in an infant’s spontaneous 
vocalizations whereas nonnative-language sounds gradually disappear. Native- 
language speech sounds are presumably discriminated and perhaps acquire their 
reinforcing properties because they often accompany the activities of important 
Caregivers in the infant’s environment (Eimas, Siqueland, Jusczyk, and Vigorito, 
1971; Trehub and Chang, 1977). 

Echoic behavior does not constitute the major part of human verbal behavior.’ 
Nevertheless, echoic behavior does not simply accompany the acquisition of human 
language and then vanish; rather, it persists in the behavior of mature speakers. For 
example, we may repeat a number given to us by a telephone operator, or the name 
of someone who has just been introduced to us. The acoustic correspondence of 
the vocal stimulus and the vocal response characterizes this class of verbal behavior. 

The unit of echoic behavior may vary in size from individual speech sounds, 
or phonemes, to extended phrases or sentences. For example, the echoic produc- 
tion of individual speech sounds may be important in initially learning to 
pronounce words in an unfamiliar language, or in generating rhymes or alliteration 
in poetry (B. H. Smith, 1968; Skinner, 1972). On the other hand, the echoic 
production of extended phrases or sentences may be important in dramatics, as 
when an actor repeats the lines whispered by a prompter, or on ritual occasions, as 
when a bride and groom repeat the phrases of a marriage vow spoken by a minister. 
The unit of verbal responding is not defined by its size; rather, the unit is defined 
by the functional relation between stimuli and responses, and is demonstrated when 
the verbal response can be shown to occur in verbal behavior independently of 
other verbal units. 

Transcription behavior. Verbal stimuli can also correspond to verbal 
responses when both the stimuli and the responses are written. In such Cases, the 


1Echoic responding occurs naturally among some nonhuman organisms, such as parrots 
and myna birds (cf. Mowrer, 1950), but the units of echoic behavior in animals are presumably 
different from those of human speakers. 
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behavior is called transcription. For example, we may copy a number from the 
telephone book, or copy an author and title in preparing a bibliography. But, as in 
echoic behavior, the movements in producing a word may not be related in any 
Simple way to the properties of the word as a stimulus; writing a word is different 
from looking at a word. The correspondence need not even be in physical 
Properties of the stimuli and responses. In the form of individual letters, a type- 
Written text may have little in common with a handwritten copy. Nevertheless, we 
can ordinarily judge whether a transcription is accurate in spelling, word order or 
Punctuation. 

Just as the unit of echoic behavior may vary from individual phonemes to 
entire phrases or sentences, the unit of transcription also may vary from individual 
characters to extended passages, depending on the circumstances in which the 
behavior occurs. A child learns to copy individual letters before learning to copy 
entire words. In doing so, the child learns the correspondences between arbitrary 
Visual forms, such as the printed and script a in upper and lower case. We may be 
Unable to provide unique physical specifications common to the letter a in its 
Various forms and to no other letters, but the problem is not different in kind from 
that of specifying the unique acoustic properties of a given phoneme as it occurs 
in different contexts (cf. Gibson, 1965). ef ( } 

It is important to distinguish transcription from copying in the pictorial 
Sense. A skilled Asian calligrapher may be able to produce an accurate and readable 
Copy of an alphabetic text even though unfamiliar with the European language in 
Which the text is written, but it would be inappropriate to speak of such copying 
as verbal. The distinction is based on the behavioral units in the two kinds of copy- 
ing. The critical features of the calligrapher’s copying are the geometrical properties 
Of the letters in the text and the marks produced by the calligrapher s strokes. The 
Critical features of transcription, however, are the verbal units, such as letters, 
Words an i isinal text and its copy. 4 

লক ("সনত vocal and written modes, echoic behavior and 
transcription behavior are formally similar. It is tempting AS assume, perhaps 

Ccause of the relative ease with which it is learned, that echoic behavior is simpler 
than transcription. Children ordinarily acquire echoic behavior early, even without 
Specific instruction, but they do not ordinarily acquire CNG oT Sp 
explicitly taught. Acquiring transcription behavior may also involve other kinds 0 
tesponses, as when a child is taught to say a word aloud while copying 

Pure transcription, in the sense of transcription CE A 
responses to the text (such as vocal or subvocal responses), proba Y SE 
rarely. A skilled typist, for example, may sometimes transcribe a tex i 
tesponding verbally to the text in other ways (as when listening to 8 co hh 
dlsewhere in the office); in such circumstances, the typist is unable to repo e 
text content even though it was accurately transcribed. A particular stimulus may 
Set the occasion for a variety of responses, and a particular response may be 
Occasioned by a variety of stimuli. The integrity of this functional classification is 
Not diminished by the possibility that transcription is sometimes accompanied by 
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Textual behavior. When a written verbal stimulus sets the occasion for a 
corresponding vocal verbal response, the behavior is textual. Thus, we may say 
aloud a number we have found in a telephone book, or read a bedtime story to a 
child. In textual behavior, the arbitrary correspondence between verbal stimuli and 
verbal responses is more obvious than in either echoic or transcription behavior, 
because the stimuli and the responses are in different modes. As with transcription, 
this behavior is usually taught explicitly, and some controversies in such teaching 
are based on assumptions about the behavioral unit appropriate to various stages of 
instruction (e.g., whether, in teaching reading, the teacher should begin with 
individual letters, syllables, or entire words: cf. Gleitman and Rozin, 1973). 

As with the other formal classes, textual behavior must be distinguished from 
other kinds of responses to written verbal stimuli. For example, if a sign says 
“Stop,” there is a difference between reading the word aloud and stopping; only 
the former is textual. In the mature reader, textual responses become less important 
than other kinds of responses to written verbal stimuli. Vocal responses diminish 
in magnitude, become subvocal, and perhaps disappear completely as a child 
becomes a proficient reader. Reading is behavior, but textual responses, as defined 
here, are not equivalent to reading. For example, a parent, while reading a bedtime 
story to a child, may discover that the most recent page, read without attention, 
cannot be remembered; yet the child’s reaction may indicate that the page had been 
read without any serious errors or omissions. This example indicates some problems 
in the colloquial vocabulary, which does not distinguish between reading that is 
simply the saying of the words on a page and the kind of behavior we call reading 
for understanding. Some advantages of a behavioral analysis come from making 
such distinctions. Textual behavior is defined in terms of vocal responses to visual 
verbal stimuli; these vocal responses may eventually diminish, as when a child’s lips 
move in silent reading, but they continue to be textual responses. Because reading 
for understanding must include other behavior along with or instead of vocal or 
subvocal speech, it is more than simply textual behavior. 

Dictation behavior. Just as a written verbal stimulus can set the occasion for 
a vocal verbal response, a vocal verbal stimulus can set the occasion for a written 
verbal response. This class of verbal behavior is called dictation (here we emphasize 
the behavior of the listener, who takes the dictation, rather than that of the 
speaker, who dictates). For example, we may write a number spoken to us by a 
telephone operator, or take notes at a lecture. The units of dictation are typically 
entire words or phrases, but individual letters may also serve as units (e.g., as when 
children are taught the written alphabet, or as when an unusual name is spelled out 
for a stenographer). 

As with textual behavior, dictation involves stimuli and responses in two 
different modes. Some of its unique Properties follow simply from the relatively 
permanent record, in the written text, produced by dictation. In addition, 
occasions for dictation are limited relative to textual behavior, because writing 
implements, unlike the vocal apparatus, are not parts of the human anatomy. 
Perhaps for this reason, there is little temptation to pursue the possible covert 
manifestations of dictation behavior; we are less likely to speak of submanual 
writing or typing than of subvocal reading (the motor theory of consciousness had 
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argued that thought was merely language behavior, like speech, that had been 
reduced in magnitude: cf. Max, 1934). Nevertheless, textual behavior and dictation 
behavior are formally similar, and either can be accompanied by other kinds of 
Verbal responses occasioned by verbal stimuli. 

Relations among the formal classes. This account of formal verbal classes has 
been limited to vocal and written stimuli and responses. It could have been 
extended to other language modes (such as sending and receiving Morse code, or 
Writing and reading Braille). It is not critical, however, to provide an exhaustive list 
of all possible formal verbal relations. The present classes are important because 
they emphasize distinctions in terms of verbal stimuli and verbal responses, and 
these distinctions are relevant to the acquisition of language. We speak of letters 
And words without regard to whether they are written or spoken, but letters and 
Words function differently as stimuli and as responses in these modes. For example, 
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Figure 10-1. Relations among the four formal classes, in which 5 = stimulus and 
R = response. Similar relations can be established for any other pair of verbal 


modalities (e.g., spoken verbal behavior and the gestural verbal behavior of 
American Sign Language). 


written languages are involved. Language instruction usually recognizes these 
distinctions; a course in conversational French is expected to emphasize the vocal 
mode, whereas one in scientific Russian is expected to emphasize the written mode.) 

The formal classes illustrate the importance of distinguishing between verbal 
stimuli and verbal responses. All the examples have been variations on the same 
theme. Distinguishing between stimuli and responses presents little difficulty in the 
analysis of nonverbal behavior. If a rat presses a lever only when a light is present, 
Wwe call the light a stimulus and the lever press a response; we are not tempted to 
reverse the terms. But in verbal behavior, the speaker’s response is the listener’s 
stimulus and the writer’s response is the reader’s stimulus, and confusion comes 


easily. Thus, we must characterize the stimulus and response components of various 
verbal relations with care. 


Tacts: The relation between 
language and the environment 


A tact is a verbal response occasioned by a nonverbal stimulus. For example, 
if a child learns to say “apple” in the presence of an apple, the child is said to be 
tacting the apple. The tact is an instance of stimulus control; a particular stimulus 
sets the occasion for a particular response. Through this relation, verbal behavior 
makes contact with the nonverbal environment. Tacting has much in common with 
naming, but it is distinguished from naming by the presence of the tacted stimulus. 
We may speak of naming an object that is absent, but we could not then say that 
the object had been tacted. 

An indeterminate variety of tacts is available to the mature speaker. We tact 
objects (chairs and tables, pencils and books), living things (flowers and trees, birds 
and insects), weather conditions (rain and snow, sun and clouds), activities (walking 
and running, working and playing), and innumerable other features of the environ- 
ment. Some tacts are general (e.g., man, woman), whereas others are restricted to 
relatively narrow circumstances (e.g., the name of a particular individual). The 
wealth of available tacts may be taken as a remarkable feature of the human 
capacity for language, but this wealth should not obscure the relations that define 
any instance of tacting. It is useful to consider how tact behavior might be 
established in nonhuman organisms; such examples force us to be explicit about the 
components that must enter into such behavior. 
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Let us imagine a food-deprived pigeon in a chamber with a window on which 
different colors can be projected. Beneath the window are three keys. When the 
window is lit red, pecks on the left key produce food; when the window is lit blue, 
pecks on the middle key produce food; and when the window is lit green, pecks on 
the right key produce food. In such circumstances, the pigeon will eventually peck 
the left key during red, the middle key during blue, and the right key during green. 
Is there any reason why the pigeon’s performance should not be called the tacting 
of red, blue, and green? 

It first might be argued that the stimuli are highly specific; the pigeon would 
not be expected to respond to red apples and red shirts and red sunsets as it 
responds to the red light in the window. We have already noted that the generality 
Or specificity of the controlling stimuli does not define the tact relation. If we are 
in doubt about the generality of the pigeon’s response to red, we may simply call 
the pigeon’s peck on the left key a tact of a red-lit window in this particular 
chamber; in effect, we may treat the left-key peck as the pigeon’s name for this 
Stimulus. 

What of the consequences of the pigeon’s pecks? If the pigeon were not food- 
deprived or if food were not delivered as a consequence, the pigeon would stop 
pecking. Even human tacting is not ordinarily without consequences. We do not 
move about our environments naming everything we see, but we tact when tacting 
has consequences. The consequences do not define the tact, but tacting remains 
tacting whether maintained by approval, pay, an examination grade, or simply 
because it is helpful to the individual to whom we are speaking. 

Finally, we may object that the pigeon has no audience. Although the 
Pigeon’s pecks may be appropriately occasioned by red and blue and green lights, 
the pigeon is not speaking to anyone. This difficulty could easily be rectified, by 
arranging signs, “red,” “blue” and “green,” that respectively lit up after the 
Pigeon’s pecks on the left, middle or ight key. If we knew our pigeon’s 
Performance was accurate and we could not look into the chamber, we could 
simply watch the signs and let the pigeon tell us which light was on. The pigeon 
Would be speaking in the language of pecks, but its verbal responses would nonethe- 
less be tacts. 

If we utter a word in the presence of an object, we do not reject this as 
naming if no one has been listening. An audience may be important in establishing 
Or maintaining tacts, but it does not define the tact relation. The tact is defined by 
the relation between stimuli and the verbal responses occasioned by them. The 
likelihood of tacting may be modified by audience variables and consequences, 
but these variables are not criteria for whether a tact has occurred. Although the 
Pigeon’s pecks in the presence of red and blue and green may be relatively trivial 
instances of verbal behavior, they are legitimate illustrations of the tact relation. 

Abstraction. We have emphasized the control of tacts by relatively simple 
environmental events. Strictly, we should say that tacts are controlled by properties 
of the environment rather than by particular stimuli or classes of stimuli. The 
Property of color, for example, is the critical determinant of the verbal response, 

red,” whether that response is occasioned by red fire engines, red traffic lights, or 
ted cheeks. This verbal discrimination based on a single property of a stimulus is 
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called an abstraction. The property is defined by the practices of the verbal 
community; it does not depend on our capacity to provide some independent 
physical measure. For example, no range or distribution of wavelengths can be 
established such that all visual stimuli within the range are called red whereas all 
outside the range are not. Our ability independently to specify some measurable 
physical dimension of the controlling stimulus is not essential to identifying a 
particular verbal relation as a tact. 

Experiments on concept formation may be regarded as demonstrating the 
acquisition of tacts (cf. chapter 7 on concepts). An example is an experiment 
(Hull, 1920) in which learners had to master names for each member of several 
sets of Chinese characters (three sets are shown in Figure 10-2). A given name 
was consistently related to a particular radical that appeared in one character in 
each set, but the position and size of the radical and the configuration within which 
it appeared varied from one set to another. Each set of twelve characters was 
presented until the learner could give the appropriate name for each, and then a 
new set of twelve was presented. By the fifth set, learners were able to name more 
than half the characters upon seeing them for the first time, and sometimes they 
were able to give the name even though they could not sketch the radical or 
describe the basis of their naming. 

Hull’s experiment may be regarded as concerned with the acquisition of 
tacting. Tacting in that situation differed from tacting in natural languages in that 
the basis for many of our tacts cannot be so explicitly defined. For example, we 
cannot say exactly what properties make an object a chair. A chair may have four 
legs or stand on a single pedestal, it may have a flat or a contoured seat or back, and 
it may be constructed from many different materials. We cannot even appeal to its 


function, because we call some objects that cannot be sat upon chairs (e.g., a toy 
chair in a set of doll-house furniture). 


Radical Characters Lists 
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Figure 10-2. Radicals and their appearance as parts of Chinese characters in six 
different lists. Learners had to give the radical name to the character that contained 
it, but characters were not repeated in successive lists. Learners became able to 
name characters appropriately upon seeing them for the first time in new lists, and 
sometimes were able to do so even when they were unable to define the common 
radical by sketching it. (From Hull, 1920, Figure 1.) 


Verbal Behavior 239 


Extremely subtle properties can control tacts. These properties may include 
relations among stimuli. Such terms as above and below, near and far, or larger and 
smaller tact properties of stimuli in relation to each other or to the speaker. 
Relational tacting occurs when we say that two objects are alike or are different, 
Or when we note that one item in a set is an odd item. Such terms rarely stand 
alone, and we will consider their joint dependence on the relational properties of 
events and on other verbal responses when we later consider the verbal relations 
called autoclitic. We also may tact complex events extended in space or time, as 
When we discriminate among different authors on the basis of their writing styles, 
Or as when we identify a musical piece as one by Debussy or a painting as one by 
Monet. At another level of complexity, we might say that Debussy and Monet have 
Something in common, even though it would be difficult to specify common dimen- 
sions of Debussy’s music and Monet’s paintings. Yet to call them both impressionists 
may be regarded as a tact of common properties. 

Sometimes the properties controlling a tact can be identified more with the 
Speaker’s behavior than with any particular feature of the environment. For 
example, if a painting or a musical composition or a person occasions the tact, 
“beautiful,” this tact presumably depends on the common responses generated in 
the speaker rather than on any particular physical properties common to these 
Stimuli. An interesting verbal case is the “tip-of-the-tongue” phenomenon. When 
We say a word is on the tip of our tongue, We are tacting the near-threshold 
availability of an appropriate verbal response in our own verbal behavior. 
Sometimes we can even report properties of the unrecalled word, such as its length 
Or part of its spelling (R. Brown and McNeill, 1966). Kk 

The vocabulary of emotion is similarly based on complex relations among 
Stimuli, situations, and the speaker. Tacts of love, hate, joy and sorrow, whether in 
Oneself or in others, are based not simply on overt manifestations such as laughter 
OF tears, but also of the various circumstances that generate the observed behavior. 
If this were not the case, it would be hard to imagine how a verbal community 
could maintain any consistency in this vocabulary; its variability is itself evidence 
Of the subtlety of the relations tacted. B } 

Any event or situation obviously may have many properties that might be 
tacted. Whether any properties are tacted and which properties are tacted will 
depend on other variables that may act upon the speaker. For example, we may 
tact the color of an apple in one circumstance and its smell in another. The 
Situation presents no difficulties; verbal responses may be determined in multiple 
Ways. An obvious case is when the tact is also a response to a question (e.g., “What 
Color is this?” “What does it smell like?”). 

Consider again the analogous case for the pigeon. We might alter the lights 
behind the windows in the chamber so that the colors appeared bright, moderate, 
Or dim. In the presence of a tone, we could then arrange that regardless of color 
left-key pecks produced food in the presence of bright light, middle-key pecks 
Produced food in the presence of moderate light, and right-key pecks produced 
food in the presence of dim light. If the pigeon’s pecks remain appropriate to color 
in the absence of tone, but become appropriate to intensity in the presence of tone, 
We might say that the pigeon was tacting color in the absence of tone and intensity 


240 Verbal Behavior 


in the presence of tone. In the vocabulary of behavior, this is a conditional 
discrimination. The analogous case in human verbal behavior is when a question 
serves as a conditional stimulus (corresponding to tone in the preceding example) 
that sets the occasion for tacting a particular Property of the many properties of 
a stimulus (e.g., the color or smell of an apple). The audience, previous verbal 
behavior, and a variety of other factors may affect tacting. In other words, tacting, 
like other kinds of behavior, is a class of responses that may come under the control 
of environmental conditions. We do not tact indiscriminately. We tact some things 
in some circumstances and other things in other circumstances; we also learn that 
in some circumstances (e.g., remarking on someone’s bad breath or dandruff), it 
may not be tactful to tact at all. 

We can also tact the temporal dimensions of stimuli, as when we say that 
something lasted a long or a short time. Often we respond verbally to stimuli that 
are no longer present, but we may best regard tacts as responses occurring in the 
presence of or very shortly after the environmental events occasioning them; 
responses occurring long after such events require special treatment because they 
typically include other behavior besides tacting (see chapter 13 on memory). Never- 
theless, environmental events have important temporal dimensions, and these 
temporal dimensions can be discriminated. Sentences in different tenses may be 
regarded as classes of tacts controlled by temporal properties of the environment. 
We may say, “It is raining,” “It is beginning to rain,” or “It just stopped raining.” 
Each sentence is a different tact; all are controlled by the event, rain, but they 
ae distinguished by the different temporal properties of that event. A past-tense 
verbal response such as “It rained” may also be uttered under the control of other 
kinds of stimuli, when it is not appropriate to call it a tact. For example, it might 
be uttered as an echoic or a textual response, Or as a tact under the control of 
present stimuli such as wet streets. In Such a case, the response may depend on 
more complex verbal behavior, as when derived from, “The streets are wet; 
therefore it must have rained.” 

The extended tact. The tact is a flexible relation. In some verbal communi- 
ties, the stimulus properties controlling a tact may be sharply defined. A student in 
a science laboratory, for example, is taught to be consistent in tacting particular 
apparatuses and procedures. This precision is less common in everyday discourse. 
We often tact properties of the behavior of our acquaintances, but the conditions 
under which someone is said to be warm or reserved, energetic or lazy, interesting 
or dull, and so on, vary considerably from one speaker to another. The ways in 
which vocabularies have evolved over time are recognized in the etymologies or 
word histories introducing each chapter. 

In the stimulus control of nonverbal behavior, we say a response has 
generalized if a response maintained during one stimulus occurs when some new 
stimulus is presented. For example, if our tacting pigeon pecked the left key when 
a novel amber light was presented, we would say that the response controlled by 
red had generalized to the amber stimulus. A similar generalization of verbal 
responses to new stimuli occurs in the extended tact. Simile and metaphor provide 
familiar instances. We may say that someone is as busy as a bee or as sly as a fox 
(simile) or that someone is a hawk or a dove (metaphor). These extended tacts 


Verbal Behavior 241 


Presumably originated through generalization across some common properties 
Sometimes shared by humans and bees and foxes and birds. It is through metaphor 
that language grows and changes (cf. Esper, 1918; Jaynes, 1976). 

Another type of extension of the tact occurs when new words are formed by 
combining existing words (e.g., the words type and writer predated the invention of 
typing machines). Vocabularies change with changes in the environment important 
to the speakers of a language (but cf. Whorf, 1956). The ways in which tacts can 
be extended in human language are so varied that a detailed account is not feasible. 
Stewart (1975) has offered some interesting examples in his account of the origins 
Of one particular class of tacts, place-names. For example, place-names are more 
likely to be based on unusual than on common features of a region. A valley in a 
forest of fir trees would probably not be named Fir Valley, but it might be named 
Oak Valley if a single oak tree stood there. Similarly, if wolves are common to a 
Particular area, no stream there is likely to be called Wolf Creek, but a stream might 

€ given that name if wolves are rare in the area and one has been sighted there. 

Tacts of private events. Another important instance of the extension of the 
tact is to private events. Some tacted stimuli are accessible only to the speaker, as 
When we say we have a headache. Such tacts depend on the verbal community for 
their establishment and maintenance, but the problem for a behavior analysis is 

OW the verbal community can establish and maintain these responses when it does 
Tot have access to the stimuli. A parent can teach a child color names because the 
Parent can see the colors that the child sees and therefore can respond differentially 
to the chilq’s correct and incorrect color naming. (So many different consequences 

Ollow from color naming that it ordinarily does not matter whether the parent 
teaches the color names through explicit instruction or simply allows the 
Appropriate discriminations to become established through casual day-to-day 
teractions ) With private events, control can be established only through 
Beneralization from tacts based upon events to which the verbal community has 
access, For example, the child may learn to say when it is in pain because the 
Parents have access to such overt manifestations as crying or facial expression, and 
if the chilg has learned the names of body parts, the two kinds of verbal responses 

AY be extended to the tact of pain in a particular place. } 

But the verbal community’s control of these tacts is weak, as demonstrated 
Y the ambiguity of such responses when the overt manifestations are no longer 
Availabe For example, if someone says “J have a headache” and leaves a social 
Batheriny it is not রি? whether that verbal response tacted a private stimulus or 
Ymply allowed the speaker to escape from dull company. Even the language of 
brivate events depends on the public practices of the verbal community. The 
plications have been explored in considerable detail by Skinner and by Wittgen- 
Stein (See esp. Day, 1969, for a discussion of parallels between their treatments of 
Private events). Ee ig: purposes it is sufficient to note that verbal behavior does 
not Ordinarily require that SHimuli be simultaneously available to both Speaker and 
th ener. In fact, some of the important consequences of verbal behavior occur when 
Speaker can tact some event unavailable to the listener. Although the language 

Private events has its own particular difficulties, we need not invoke any new 


ca 2 
SSes of verbal responding to deal with it. 
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Tact relations, by themselves, are only one part of verbal behavior, but 
through these relations verbal behavior comes into contact with the environment. 
Verbal behavior gains its power from these relations because without them there 
would be nothing of which we could speak. Some of what we call truth depends on 
the consistency of relations between verbal behavior and environment. Important 
correspondences between verbal behavior and the environment, however, depend 
not only on the tact relation, but also on the relations of verbal responses to each 
other. 


Intraverbal behavior 


Much of what we learn consists of verbal responses occasioned by other 
verbal responses. We learn to recite the alphabet, to count, to give appropriate 
answers to arithmetic problems, to recite poems, to provide definitions for terms, 
and to state facts. Utterances that seem superficially to have the character of tacts 
are often only verbal responses occasioned by other verbal responses. No one alive 
has seen Caesar at the Rubicon or Napoleon at Waterloo. We might argue that we 
have seen facsimiles, as in paintings, but even if we recognized this painted figure as 
Caesar and that as Napoleon, would we be likely to recognize the setting as the 
Rubicon or Waterloo unless the picture caption told us so? We do not ordinarily 
learn historical details in that way. Instead, given names or dates, we learn to say 
when or in what order events occurred. 

In intraverbal behavior, one verbal stimulus sets the occasion for another 
verbal response. Free association provides a familiar example (Galton, 1879). The 
Speaker, given a verbal stimulus, is asked to produce a verbal response. Because the 
immediate consequences of free associations are usually minimal, and because any 
given verbal stimulus may set the occasion for a variety of different responses, other 
variables may contribute to the particular verbal response given on any occasion. 
The procedure is therefore assumed to tap verbal responses that are of relatively 
high probability in the speaker’s verbal repertory. 


In his discussion of intraverbal behavior, Skinner (1957) treats free associa- 
tion as follows: 


One verbal response supplies the stimulus for 
net effect is revealed in the classical word 
subject is simply asked to respond verbally 
aloud any responses he may “think of”_—that is, find himself making silently. 
Echoic and textual responses are commonly produced but are either 
prevented by instruction or excluded from the results. Such an experiment, 
repeated on many subjects or on one subject many times, produces a fair 
sample of the responses under the control of a standard stimulus in a given 
verbal community. .. . The intraverbal relations in any adult repertoire are 
the result of hundreds of thousands of reinforcements under a great variety 
of inconsistent and often conflicting contingencies. Many different responses 
are brought under the control of a given stimulus word, and many different 
stimulus words are placed in control of a single response. For example, 
educational reinforcement sets up many different intraverbal operants involv- 
ing the cardinal numbers. Four is part of the occasion for five in learning to 


another in a long series. The 
-association experiment. Here the 
to a verbal stimulus, or to report 
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count, for six in learning to count by twos, for one in learning the value of 
T, and so on. On the other hand, many different verbal stimuli come to 
control the response four, e.g., one, two, three. .. or two times two make. ... 
Many different connections between verbal responses and verbal stimuli are 
established when different passages are memorized and different “facts” 
acquired. The word-association experiment shows the results. (Skinner, 
1957, pp. 73-74) 


In its simplest form, intraverbal behavior has been the basis for a substantial 
part of the research on human verbal learning. The classic experiments of 
Ebbinghaus (1885) were specifically concerned with the learning of arbitrary verbal 
combinations. So that the learning would not be contaminated by previous 
language experience, Ebbinghaus created nonsense syllables for his experiments. 
In the current vocabulary, we would say that the experiments were designed to 
eliminate all other possible verbal relations but intraverbal ones. In paired-associates 
learning of nonsense syllables, one nonsense syllable is a stimulus item uniquely 
paired with another nonsense syllable as a response item. The learner’s task is to 
Bive the appropriate response item for each stimulus item. We will not review 
human paired-associates learning here, because it will be treated in Chapter 12. It is 
Sufficient to note that paired-associates learning represents a relatively pure case of 
ntraverbal learning. Hl Hl 

Another variety of intraverbal relation occurs in serial learning, the learning 
Of a list of items in a particular order. Learning the arbitrary order of the letters of 
the alphabet and learning to count are common examples of serial learning. Such 
learning can create extended units of verbal behavior. Much research on intraverbal 
learning was based on the assumption that, in the production of word sequences, 
Successive parts of the utterance served in turn as discriminative stimuli for later 
Parts (cf. chapter 6 on chaining). Although the acquisition of sequential verbal 
responses may involve such relations (as in memorizing a poem or a definition or 
the order of elements in the periodic table), a point may be reached at which it is 
no longer appropriate to speak of each part of the utterance as providing the 
Stimulus for the next part. Intraverbal control may continue as a component of 
extended utterances (it is most obvious if another speaker provides the stimuli, 
As When a prompter assists an actor by whispering the beginning of the next line in 
2 script), but eventually an extended utterance itself may come to function as an 
Independent verbal unit. The development of hierarchically organized, higher-order 
Verbal units from verbal-response sequences is a critical feature of verbal behavior. 
Ust as sequences of letters or phonemes are regarded as units when they occur in 
Words, sequences of words may be regarded as units when they occur as phrases or 
Sentences. Such maxims as “Haste makes waste” and “He who hesitates is lost” 
are reasonably viewed as unitary verbal responses. The analysis of such extended 
Units and classes of such units in terms of their consistent sequential and relational 
Properties is an analysis of behavioral structure. L 

It is paradoxical that, throughout much of the history of research on human 
Verbal learning, the learning of intraverbal relations has been emphasized to the 
relative neglect of the learning of tacts. One reason may be that it is simpler to 
define and manipulate verbal than nonverbal materials as stimuli. Another may be 
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that intraverbal relations are an important component of standard educational 
practice. For example, when a child who is mastering the multiplication table gives 
42 as the response to 6 x 7, the response is strictly intraverbal only if the correct 
response does not depend on other intervening arithmetic behavior (e.g., adding six 
sevens, counting by sixes, or counting the boxes in a six-by-seven rectangle). Before 
we can deal with the case in which the answer is derived rather than learned intra- 
verbally, we must consider the consequences of verbal behavior and the effects of 
verbal instructions. We must also examine verbal interactions, and the behavior of 
the listener and reader. These topics prepare the way for a treatment of the 
complex manipulations in generating novel verbal behavior in novel situations. 


The consequences of verbal 
behavior 


We have emphasized the stimuli that control verbal behavior, but verbal 
behavior also has consequences. As with all operant behavior, these consequences 
may affect subsequent verbal responding. In a speech episode such as a simple 
two-person conversation, each person provides an audience for the other; each 
person, in other words, sets an occasion on which verbal behavior may have 
consequences. Audiences are varied in their stimulus properties; we speak into 
telephones, write messages, or address large groups of people. Often the 
consequences for the speaker are simply a listener’s subsequent verbal behavior. It 
does not require a laboratory experiment to demonstrate that the listener’s 
response can maintain the verbal behavior of a speaker. We tend to stop speaking 
to people who do not react to what we say. To this extent, we may say that the 
listener's responses reinforce the speaker’s verbal behavior. 

Some research on the operant control of verbal behavior has been on whether 
particular verbal responses, such as “yes” or “uh-huh,” can reinforce a speaker’s 
verbal behavior (e.g., Greenspoon, 1955; Azrin, Holz, Ulrich, and Goldiamond, 
1961; Rosenfeld and Baer, 1970). Experiments have tested whether such conse- 
quences can modify the frequency of particular response classes (e.g., plural nouns) 
or the substantive content of a conversation. Results have been inconsistent, but 
failure to show that such consequences can affect the frequency of verbal classes 
does not bear on whether verbal behavior can be dealt with in terms of reinforcing 
consequences. Failure to demonstrate that a particular event serves as a reinforcer 
in a particular situation simply means that the term reinforcer is inappropriate to 
that instance. 

The consequences that may serve as reinforcers for human verbal behavior, 
like the discriminative stimuli that set the occasion for it, are subtle and varied. 
The listener’s responses may be verbal or nonverbal (e.g., answering a question, or 
coming to dinner when called), and responses that are effective in one situation or 
when made by one listener may be ineffective in other situations or when made by 
another listener. In addition, a particular consequence of verbal behavior may have 
different effects on the tendency to speak than on what is said. One difficulty with 
experiments on verbal reinforcers has sometimes been that they have attempted 
to modify the relative frequencies of different response classes with the same con- 
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Sequences that are presumed to keep the speaker talking. In any case, it is 
important to consider what maintains both the speaker’s behavior and the listener’s 
behavior. 

The mand as a class of verbal responses. One obvious kind of consequence 
for verbal behavior is illustrated when we are given something we ask for. If a child 
Says “milk” and receives a glass of milk, we might say that the milk reinforces this 
Verbal response. Verbal esponses that specify their reinforcers have been called 
mands (demands and commands, for example, specify behavior in which the 
listener must engage). The response may or may not occur in the presence of the 
reinforcer. For example, a child may mand milk even if a glass of milk is not 
Present. An analogue from animal research may be helpful. Assume that an experi- 
Mental chamber for a rat contains one lever that produces food when pressed and 
another that produces water when pressed. If the rat presses the first lever only 
when food-deprived and the second only when water-deprived, we could argue that 
the presses are, respectively, food mands and water mands. Although it would not 
be essential, we could make the analogy more convincing by arranging signs that lit 
Up for the experimenter when either lever was pressed, saying “Please give me food” 
and “Please give me water.” Except that the rat’s vocabulary is limited to two levers, 
the relations between the lever presses and their consequences are similar to those 
between verbal requests and their consequences. 

Yet the account is not satisfactory. For example, we could imagine a child 
Seeing a new toy, learning its name, and then asking for it, even though asking for 
this particular toy had never been reinforced in the past. We must conclude that, as 
4 category of human verbal behavior, manding cannot consist simply of many 
Separate response classes corresponding to each of the many potential consequences 
that could be manded (cf. Schick, 1971). Rather, manding is a single class of 
tesponses, characterized by the inclusion of the verbal response that in other 
‘lrcumstances tacts the reinforcing consequence. Within this general class, we can 
distinguish subclasses: e.g., mands that specify the delivery of particular stimuli 

“Please five me an apple”), and mands that specify the behavior of the listener 
(“Please Open the door for me”). In everyday discourse, we also distinguish mands 

Y the consequences that may follow for the listener (e.8., pleas, requests, orders, 
etc.). For example, we speak of a demand when a speaker specifies an aversive 
Consequence for a Tstenions noncompliance. Another important subclass consists 
of mands that specify the listener's verbal behavior (e.g., “Please give me your 
Name,’ cWhat is this called?”). These mands may be further subdivided according 
ke 4 Variety of features. For example, we may speak of prompts when the appro- 
Prlate verba] response is already known to the speaker (as in giving a hint when a 
8 ild is unable to solve a riddle), and probes when it is not known (as in a police 
‘terrogation), We distinguish other properties of verbal mands when we speak of 
Uestions ang instructions. K 

The role of instructions. We noted that verbal behavior can have either verbal 
or nonverbal consequences. In one way or another, the consequence is usually a 
“ange in the listener’s behavior. For example, if we tell someone who is about to 
8০ Outdoors that it is going to rain, we may change the likelihood that the person 
Wl take gp umbrella. The distinction between production and comprehension 
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of language is mainly concerned with whether we emphasize the behavior of the 
speaker or of the listener. The verbal community maintains the correspondence 
between verbal behavior and environmental events in tact relations because only if 
this correspondence is consistent will the speaker’s verbal behavior provide 
discriminative stimuli that can occasion appropriate behavior on the listener’s part 
(Risley and Hart, 1968). If the speaker’s verbal behavior is controlled by environ- 
mental events inaccessible to the listener, the speaker’s verbal behavior may become 
a potent discriminative stimulus for the listener’s behavior. For example, a listener’s 
response to the tact, “fire,” may have important consequences even if only the 
speaker has seen the conflagration. 

Control by the speaker’s verbal behavior will weaken if the speaker tacts 
unreliably. The probability of telling the truth or lying may depend on the poten- 
tial immediate consequences of each for the speaker, but the long-term 
consequences of lying may be that listeners will stop acting on the basis of that 
speaker’s verbal behavior (the tale of the boy who cried “wolf” illustrates these 
contingencies). 

The listener’s behavior may come to be strongly controlled by the major class 
of verbal behavior called instructions. A script is a set of instructions to an actor, 
and a text is a set of instructions to a reader. In both cases, the instruction specifies 
verbal behavior, i.e., what is to be said. The instructor who defines a term, for 
example, is specifying circumstances in which that term and its definition will be 
appropriate in the student’s future verbal behavior. One important characteristic 
of instruction is that it often substitutes verbal discriminative stimuli for natural 
contingencies, as when a parent tells a child, “Don’t touch the stove or you will 
burn yourself.” This property of verbal instruction has far-reaching implications. In 
general, instructions may change the listener’s behavior when the natural 
consequences by themselves may be ineffective or may be effective only slowly. If 
we invite friends to a party, for example, we give them directions rather than letting 
them search for the place on their own. 

Consider a specific case. Someone without any explicit instruction types with 
the index fingers, one letter at a time, in the method called hunt-and-peck. For the 
novice typist, this method is faster than touch typing, in which each finger has a 
particular resting position at the keyboard. The immediate consequences of the two 
typing methods favor the former. But in learning touch typing, the long-term 
consequences of following instructions to place the fingers appropriately and to 
type particular letters with particular fingers eventually outweigh the short-term 
advantages of the hunt-and-peck system. What is learned is not solely a particular 
method of typing. To follow instructions successfully, the learner must ignore 
natural consequences (in this instance, the text is at first produced more slowly by 
touch typing than by hunt-and-peck typing). Because of the practical advantages of 
instruction, the verbal community shapes the behavior of following instructions 
across a substantial range of activities throughout a substantial portion of each 
individual’s lifetime. Instructions become a class of verbal stimuli characterized by 
its effectiveness in overriding natural contingencies. 

A major achievement of human verbal behavior is that it allows behavior to 
be controlled by descriptions of contingencies, in the verbal behavior of others, 
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rather than by the contingencies themselves. But the advantages of these unique 
Properties of verbal behavior are accompanied by their special problems. A history 
of following instructions may make individuals susceptible to verbal control by 
authority figures (e.g., Milgram, 1963). In addition to the abuses that may arise 
When individuals simply follow orders, instructions may create problems in more 
subtle ways. For example, when a simple task such as pressing a telegraph key is 
established in humans by instructions instead of by shaping the key press, and key 
presses earn money according to various schedules of reinforcement, the 
performances of those given instructions are typically insensitive to the schedule 
Contingencies compared to the performances of those whose responses were shaped 
(B. A. Matthews, Shimoff, Catania, and Segvolden, 1977; cf. Reber, 1976). With 
Uninstructed responding in humans, fixed-interval schedules produce their 
characteristic scalloping and ratio schedules produce higher response rates than 
Yoked interval schedules (cf. chapter 8); neither of these schedule effects occurs 
reliably when responding is instructed. Just telling a human in an experimental 
Setting to “press the key” will produce persistent responding that is insensitive to 
‘ts consequences. 

| The problem is that we do not ordinarily want others to do what we say 
Simply because we say it. A parent or teacher who gives instructions to a child to 
establish certain kinds of behavior might prefer but cannot be confident that the 
Natural contingencies will eventually control the behavior and make instructions no 
longer necessary. For example, a reason for telling a child to put on overshoes 
fore going out to play in the snow is that the instruction may avoid the later 
AVersive consequences of the child’s returning home with cold wet feet. But if the 
child always obeys the instruction, the natural contingencies will never have an 
Opportunity to act on the child’s behavior, whereas if the child disobeys the instruc- 
tions, the aversive consequences that follow from snow and unprotected feet will 
enhance the control by instructions on future occasions. Thus, if we try to teach by 
telling others what to do, we may simply reduce the likelihood that they will learn 
tom the consequences of their own behavior. There is no easy solution to this 
dilemma, We must always choose between the immediacy and convenience of 
Verbal instructions and their longer-term effects on the learner's sensitivity to the 
Consequences of behavior. 

The listener’s or reader’s behavior. Given that the speaker’s verbal behavior 
Provides discriminative stimuli for the listener, the listener’s behavior can be 
characterized simply as the behavior occasioned by these verbal stimuli. The 
Steners responses to verbal stimuli can be as varied as the responses to any other 

inds of environmental events. Many of the possible verbal responses have already 
een considered, in the classification of echoic responses, intraverbal responses, and 
50 0f.. Some. nonverbal responses occasioned by verbal stimuli are also obvious 
enough that they do not require special consideration. Whether the critical stimulus 
S a red light, a traffic officer's outstretched hand, the utterance “stop,” Or a tree 
i across the road, the driver’s response of stepping on the brakes Hustrates 
i Phenomenon of stimulus control. As we move from watching an actual incident 
Watching the incident acted in a play or 2 film and then to reading the script for 

1e actual incident and then to reading a description of the incident in a story, the 
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common feature that holds these cases together must lie in consistencies of stimulus 
control over verbal and nonverbal behavior. 

As with nonverbal stimuli, not all responses to verbal stimuli are operant. 
For example, if a spoken word is paired with a stimulus that elicits autonomic 
responses (e.g., electric shock), the word may come itself to elicit these responses. 
The phenomenon, sometimes called semantic conditioning (e.g., Riess, 1946), is 
a verbal equivalent to classical or respondent conditioning of nonverbal responses. 
Responding generated by these procedures generalizes across semantic as well as 
Phonological dimensions of verbal stimuli. For example, if electric shock is paired 
with a farm word, such as barn, the conditioned galvanic skin response is more 
likely to generalize to other farm words, such as cow, than to words that simply 
have some letters in common with the original word, such as burn. 

Yet if we say that a listener has understood something, it seems unlikely that 
we can provide an adequate account of the listener’s response simply by appealing 
to the pairing of words with words (as in the giving of definitions) or of words with 
events (as in the teaching of tacts). The problem of meaning must reside at least in 
part in properties of the listener’s responses to verbal stimuli. One critical property 
may be the correspondence between responses occasioned by a verbal stimulus and 
responses occasioned by the nonverbal events that the verbal stimulus ordinarily 
tacts. Many empirical studies of verbal behavior are primarily on ways in which 
verbal responses that tact related environmental events vary together in the verbal 
behavior of a speaker or have common effects on a listener. Whatever else is 
involved in the listener’s behavior, it is reasonable to assume that the response to a 
tact shares some properties with the response to what is tacted. 

An illustration comes from research on language in the chimpanzee (Premack, 
1970), in which the chimpanzee’s verbal responses, instead of speech, consisted of 
Placing plastic chips of various shapes and colors on a magnetic board. Like sign 
language (cf. R. A. Gardner and Gardner, 1969), this language circumvents the 
limitations of the chimpanzee’s vocal apparatus; in addition, its advantage over such 
transient verbal modes as speech or gestures is that it places fewer demands on 
memory capacity in establishing verbal behavior. 

Some illustrations of the kinds of verbal stimuli and responses that a 
chimpanzee called Sarah could deal with are illustrated in Figure 10-3. Sarah was 
capable of varied verbal behavior, including not only manding and tacting but also 
the relational verbal responses we will later refer to as autoclitic. Here we consider 
only one example of the chimpanzee’s verbal behavior. In a matching procedure, 
Sarah was taught to select from a pair of alternatives the property that matched a 
sample object. Given a round red apple, for example, she learned to select a red 
stimulus from a red-versus-green pair, and a round stimulus from a round-versus- 
rectangular pair. Sarah also made the same selections when the sample was the blue 
plastic chip that was the name for an apple. In other words, her response to the 
verbal stimulus, apple, was the same as her response to the nonverbal stimulus, an 
apple, that it tacted; the response was not controlled by the physical properties of 
the verbal stimulus itself (such transfer from a nonverbal stimulus to the 
corresponding tact may in fact be an appropriate criterion for calling responding 
verbal). 


No or 


Not 
Sarah 
Name 
of 
Take 


Figure 10-3. Some examples of verbal materials used in teaching language to the 
chimpanzee, Sarah. Each word consisted of a colored piece of plastic that could 
be placed on a magnetic board. Statements were formed by arranging the words 
Vertically, as in the sequence on the left, Sarah take apple. The middle column 
Shows abstract words: a negative, no or not; name of, eventually used to add 


Answer would be a choice of either the yes chip or the no chip. The bottom right 
Shows the question, name of apple?, where the object on the right is an actual 
apple; Sarah’s answer would involve replacing the interrogative chip with the 
triangular chip that is the name for apple. (From Premack, 1970, Figures 1-4.) 


Human language also includes examples in which the properties that a verbal 
tesponse ordinarily tacts override the properties of the verbal response itself. For 
eXample, if words are printed in different colors, it is difficult to tact these colors 
tapidly if the words themselves are incompatible color-names (e.g., the word red 
Printed in green: Stroop, 1935); we read words, and do not ordinarily attend to 
Physical Properties such as the color in which they are printed. The relation 
between stimuli and the verbal responses they occasion clarifies some logical 
Paradoxes of the language of reference. For example, if a self-descriptive word is 
‘Omologica]l (e.g., the homological word short is short), whereas a nonself-descrip- 
live Word is heterological (e.g., the heterological word long is not long), what then 
S the word heterological? That heterological cannot be homological without being 
eterological is a paradox of logic, but it is not a paradox of verbal behavior; a 
Verbal response can tact a verbal stimulus, but it cannot tact itself. 
In dealing with the formal relations, We saw that the symmetry and reversi- 
Of stimulus and response relations increase the likelihood of speaking in terms 
Of words rather than in terms of specific vocal or written modalities. Similar 
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correspondences exist in the relation between tacts and environmental events; these 
correspondences may be important when we speak of meaning independently of 
whether a word functions as a stimulus or a response. Consider, for example, rain 
as a stimulus, the word rain, and responses occasioned by rain, such as using an 
umbrella or putting on a raincoat. Both rain and the word rain, as stimuli, may 
occasion either a verbal response, the word rain, or a nonverbal response 
appropriate to rain. We may look out a window, see rain, and pick up an umbrella 
on the way out of the house, but the latter response might also be occasioned by a 
verbal stimulus, as when a weather report predicts rain. On seeing rain, we may 
speak of the rain to someone else, who may than take an umbrella, but this 
speaking might also be occasioned by the weather report. The relations are 
summarized in Figure 10-4. 


S onverbal Ronverbal 
(rain) (take umbrella) 
Serbal Ryerbat 
(the word rain) (the word rain) 


Figure 10-4. Relations among verbal and nonverbal responses occasioned by verbal 
and nonverbal stimuli, where S = stimulus and R = response. Such consistencies 
are essential to understanding and meaning. (Cf. Figure 10-1.) 


When we say that the listener, on repeating what the speaker had said, under- 
stood the speaker, we are ordinarily not satisfied to call the relation between rain 
as a verbal stimulus and rain as a verbal response echoic. Presumably then we judge 
understanding or meaning not by any single relation between stimuli and responses, 
but rather by the integrity of the sorts of interrelations illustrated in Figure 10-4. 
We may say that someone understands something that has been said when the 
individual can repeat what has been said not because someone else said it but for 
the same reasons that the other individual said it. Such behavior implies the 
consistent relations among verbal and nonverbal responses in Figure 10-4, and 
therefore must be central to the concepts of meaning and understanding. 

In any case, these examples show that the response to a verbal stimulus may 
have properties in common with responses to the stimulus that is tacted. We know 
only a very little about the properties of these responses, but they are undoubtedly 
aspects of the phenomena we speak of in terms of meaning, reference, and 
semantics. Such behavior can be established by verbal instruction, as when we teach 
the names of things; the nature of instruction must therefore stand as a central issue 
in the analysis of verbal behavior. 


Autoctlitic processes: Verbal 
behavior conditional upon 
verbal behavior 


We already noted that verbal behavior, like any other event, can be tacted. 
The functional relations that can be observed when nonverbal stimuli set the 
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Occasion for verbal responses can also be observed with verbal stimuli. No new 
kinds of relations are involved, but the complexities generated when verbal behavior 
is built upon other verbal behavior need special comment. Verbal behavior that 
depends upon other verbal behavior for its occurrence and that modifies the effects 
Of other verbal behavior is called autoclitic. It includes words that modify the 
effect of other words on the listener, and the complex orderings and arrangements 
that we call grammatical structure. A brief account cannot do justice to the richness 
Of autoclitic processes; what follows will survey only a few representative examples. 

Descriptive autoclitics. Many verbal responses tact the conditions under 
Which other verbal behavior is emitted, and thereby modify the way in which the 
listener responds to what we say. In “] doubt the coffee is ready,” “I think the 
Coffee is ready,” and “I am sure the coffee is ready,” the speaker tacts some 
Property of the tendency to say, “The coffee is ready,” and the probable relation 
between that response and the condition of the coffee. The effect is to modify the 
likelihood that the listener will act upon the verbal response. The relation of I 
doubt, I think, and I am sure to the remainder of each sentence is analogous to that 
between the stimuli of a conditional discrimination. In “1 recall that it rained 
Yesterday,” “] read that it rained yesterday » and “I heard that it rained yesterday,” 


the speaker specifies the source of the verbal response, “Jt rained yesterday.” Other 


descriptive autoclitics tact the speaker's reaction to current verbal behavior, as in 


Tam sorry to report...” or “Tam pleased to Say. . . ন | 
Qualifying and quantifying autoclitics. We do not tact everything we See, 
and, conversely, we sometimes respond as if tacting when the stimulus is absent. 
The qualifying autodlitio accompanying such verbal behavior is typically some form 
Of the verbal response, no. For example, we do not continuously say “The coffee is 
vot ready” under conditions of unready coffee. Instead, this verbal response is 
likely to occur when circumstances set the occasion for saying “The coffee is 
ready” (e.g., the smell of coffee, or the question, “Is the coffee ready?”), but the 
conditions do not exist under which this verbal response can be uttered as a tact 
(ie., the coffee is in fact not ready). | 
Assertion, like negation, is also autoclitic, but the verb is serves many func- 
টী Sometimes it specifies that the verbal response it accompanies is a tact (“This 
is a book”), sometimes it prescribes equivalence between verbal responses A 
human is a featherless biped”), and sometimes, as already considered, it specifies 
temporal properties (“It is cold now”). The particular function of is often depends 
On other verbal responses or, in other words, on context. Not only does it function 
AS a conditional stimulus with respect to the effect of other verbal behavior, but its 
“nction may in turn be conditional on other verbal behavior. Cl 
The effects of autoclitics may be quantitative as well as qualitative. Examples 
Are few, some, and many, and the plural forms of nouns and verbs. The effect of 
Often in “This text is after misunderstood” may be paraphrased as a statement that 
he tact, “This text is misunderstood,” may often be uttered appropriately. As we 
Shall see in the treatment of language structure, We have no independent nonverbal 
Means for characterizing these relations, but paraphrase is often appropriate 
ia it makes explicit the conditional relations among the components of the 
terance. This is most obvious when the autoclitics are mands specifying the 
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listener’s verbal behavior (e.g., vice versa is conditional upon a preceding verbal 
response, and may be interpreted as a mand specifying that the listener generate a 
new verbal response reversing the order of particular components of the original 
verbal response). 

Relational autoclitics. Some verbal responses can specify events only through 
their relations to other verbal responses. For example, words such as above, before, 
and of are not simply tacts of particular environmental circumstances. They occur 
in combination with other verbal responses, and depend on these other verbal 
responses for their effects. Because agreements of case and tense are also 
conditional upon properties of events tacted, these too can be treated as autoclitics. 
A structural unit of verbal behavior, such as a sentence in passive voice and past 
tense, can be regarded as a complex of tacts of various relational and temporal 
properties of a set of events. 

An analogy from the behavior of a pigeon may be helpful. Suppose we add 
some new dimensions of stimulus control to the tacting of the earlier examples. We 
now project a circle or a square along with the colors in the window. We maintain 
left-key pecks during red, middle-key pecks during blue, and right-key pecks during 
green. But now we make reinforcement contingent upon fast pecking if a circle is 
in the window, and slow pecking if a square is there. Next, we vary the size of the 
circles and squares, and we make reinforcement contingent upon the force of the 
pecks: strong pecks when the figure is large, and weak pecks when the figure is 
small. If the pigeon’s pecks come under the combined control of these stimulus 
dimensions, we will observe fast strong pecking on the middle key in the presence 
of a large circle on blue, and slow weak pecking on the right key in the presence 
of a small square on green. Just as a given set of events may occasion the separate 
words of a sentence in a particular grammatical order, the stimuli in the window 
may occasion responding that varies with their several properties. Incidental 
features of responding, such as the topography of successive pecks, may vary 
consistently with particular combinations of properties, just as stress patterns and 
other features of human verbal behavior may vary consistently across different 
grammatical forms. 

We cannot say how difficult it would be to establish such a performance in 
the pigeon, nor whether the pigeon’s response would be appropriate to a novel 
combination of stimulus properties (e.g., whether, if we had trained with respect to 
shape and size only during blue and green, the pigeon would peck with appropriate 
rate and force on the left key during red). The point is not to explore the pigeon’s 
capacity for verbal behavior; rather, it is to illustrate the nature of the multiple 
dimensions of stimulus control over varied response properties in complex verbal 
behavior. To the extent that environmental properties occasion classes of verbal 
responses that are invariant even when they occur ip combination with other 
response classes, we may treat such classes as verbal units (e.g., as when the present- 
tense active-voice sentence structure remains invariant across a variety of different 
tacted events). Such units may have complex structures, but it is only because such 
units and their components can be combined in novel ways that novel verbal 
behavior controlled by novel environmental conditions can be generated; things 
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Previously unknown may come to be tacted only on the basis of verbal behavior 
that is available with respect to things known. 


Interaction between verbal 
behavior and environment 


Our own verbal behavior may set the occasion for subsequent behavior. For 
example, we may act on a reminder we wrote for ourselves at an earlier time. 
Sometimes the subsequent behavior is verbal: we reword sentences, draw 
Conclusions, derive solutions. These manipulations are of special interest when, as 
in problem solving, they lead to behavior that was unavailable earlier. The power 
Of verbal behavior resides in how it provides discriminative stimuli setting the 
Occasion for responding that may have important environmental consequences. 

Consider an example from mathematics, a convenient illustration because 
the notation of mathematics is an exact prescription of the verbal responses appro- 
Priate to particular verbal stimuli. Understanding of addition or multiplication is 
Judged by the number of ways in which an individual can respond to relevant verbal 
Stimuli, An individual should be able to define these operations, to discriminate 
between cases to which they can apply and cases to which they cannot, to give 
Answers to specific problems, and to derive each from simpler counting operations. 
Such behavior is verbal, and is necessary and sufficient for the statement that 
addition and multiplication are understood by that individual. 
্ The learning of arithmetic involves intraverbal and autoclitic processes, but 
ts particular advantages come when it is combined with tacting of the numerosity 
Ef environmental objects or events (cf. Ferster and Hammer, 1966). A child may 
calculate the number of objects in a rectangular array by multiplying the number 
Of rows by the number of columns, Or simply by counting all of the objects. Either 
Operation is verbal, but the outcome is a verbal response that now may serve as a 
tact of the quantity of objects in the array The structure of arithmetic corresponds 
to the structure of the environment so that new verbal responses occasioned by 
Arithmetic behavior may then function effectively as tacts. This property of arith- 
Metic behavior may be a property of verbal behavior in general. The components 
of thinking, deduction, inference, and other processes that have been called 
Cognitive Tay not be aS well defined, and may often be hs accessible to an 
Observer than the explicitly structured behavior of arithmetic. Nevertheless, they 
Must share properties with the structure of the environment, because the environ- 


ens s 
ent is that of which we speak. 


Psycholinguistics 


ELEVEN 


Three primary terms in the analysis of language have been semantics, the 
study of meaning, syntax, the study of grammatical structure, and pragmatics, 
the study of the functions of language. Semantics can be traced to the Greek 
Sema, a sign or thing seen, syntax to the Greek taxis, arrangement, and 
pragmatics to the Greek prassein, to effect or do. Pragmatics is a relative of 
bractice. Grammar, through the Greek graphein, to scratch or write, and 
8ramma, a picture or a writing, is closely related to graph, program, and 
topography. 


In this chapter, we turn from the functions of verbal behavior to its structure. 
nBuage is organized. Words are ordered in sentences, and sentences are ordered 
‘1 texts. We also speak of how words are related to each other, as when we say that 
two words are similar or dissimilar in meaning. Psycholinguistics calls these topics 
ntax and semantics. The problem of syntax is how the words in a language are 
Organized in sentences; syntax is therefore concerned with the grammatical struc- 
ture of language. The grammatical ordering of words can be regarded as an 
tutoclitic process. But in this chapter we will approach language from the structural 
Point of view of psycholinguistics, noting the relation of this vocabulary to the 
“nctiong] account of chapter 10 when appropriate. | 
After syntax, we will consider semantics, Which deals with the problem of 
Meaning When we introduced verbal behavior, we questioned the traditional 
Concepts of meaning and reference. In this chapter, we will see what can be said 
about oe traditional concepts. In effect, we will ask what it is that We tact 
When We say that particular words are related in meaning. Psycholinguistics deals 
With a Speaker’s vocabulary in terms of the speaker’s lexicon, the dictionary of 
a available in the speaker’s verbal behavior. We must therefore examine the 
Cture of the lexicon. These topics, often related to each other, will pave the way 
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for our treatment of verbal learning and memory in later chapters. (Psycholinguis- 
tics has sometimes included a third topic, pragmatics, the uses of language; this 
corresponds most closely to the functions Of language already considered in chapter 
10.) 


SYNTAX: THE GRAMMATICAL 
STRUCTURE OF LANGUAGE 


In what ways can we describe how the words of a language are organized in 
sentences? We can count or transcribe or classify words with some consistency. We 
then can analyze sentences as orderings or arrangements of these discrete units. To 
begin, we know that we can discriminate grammatical from ungrammatical 
examples of the sentences of a language. Among grammatical sentences, we can 
classify or tact various categories of sentences, such as active voice, passive voice, 
past tense, and so on. But how do we define the dimensions along which we make 
these distinctions? We are concerned with describing the grammatical structure of 
language. This is a problem of stimulus control: sentences are stimuli, and their 
properties set the occasion for particular verbal responses, such as naming types of 
sentences. When we say that a sentence is declarative (a statement) or interrogative 
(a question), we are tacting something about the sentence. The grammar of 
sentences is primarily a structural issue. Before we are willing to say whether a 
sentence is grammatical, we do not ordinarily ask about the circumstances in which 
it was uttered or the effects it had. 

We could begin by trying to list all grammatical sentences. But that task 
would be neither feasible nor relevant. Most of what we say or write is novel. 
Sentences are only occasionally repeated, and the possible lengths and varieties of 
sentences are unlimited. Thus, our list would be infinitely long. Even if we could 
produce one we would not have solved our problem, because we still would not 
know what made a given sentence grammatical. Instead of lists, we need an exhaus- 
tive description of the kinds of grammatical sentences. Although the number of 
possible sentences is unlimited, the number of kinds of sentences is presumably 
finite. In the language of Psycholinguistics, we might speak of writing a grammar 
with a finite number of rules. 

Three types of accounts describe the Structural regularities of grammatical 
sentences (Chomsky. 1956; Chomsky and Miller, 1963; Catania, 1972). The first 
is restricted to sequential dependencies between successive words or word 
sequences, corresponding to the chaining of responses considered in chapter 6. 
The second describes grammatical structures in terms of the constituents or com- 
ponents of those structures. The third examines the transformations that show how 
a given grammatical structure with a particular arrangement of constituents is 
related to other grammatical structures. For example, consider the saying, “He who 
hesitates is lost.” We would be examining Sequential dependencies if we presented 
“He who hesitates” as a verbal stimulus and determined the frequency with which 
“is lost” is given as a response. If we instead studied how parts of the sentence are 
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named subject and predicate or pronouns and verbs, we would be dealing with 
constituents. And if we then described the relation of this saying to such 
Paraphrases as “He is lost who hesitates,” we would be dealing with grammatical 
transformation. We will find that each of these three structural features may 
Contribute to our judgments of grammaticality. They are not mutually exclusive; 
none is adequate, by itself, to characterize the structural properties of verbal 
behavior. 


Sequential analyses 


We treated some properties of sequential analysis in the context of intraverbal 
behavior (chapter 10). There we considered the verbal responses to verbal stimuli 
called word associations. Word associations can be extended to verbal stimuli 
consisting of several words instead of a single word. Such sequences are the basis for 
Constructing different orders of approximation to English. For example, suppose 
One is asked to complete a sentence given its first three words. The first of the three 
Words is then dropped. and the two remaining words plus the next word are 
Presented to a second participant, who now generates a sentence that continues 
those three words. Again the first word is dropped, and the next three remaining 
Words are presented to still another participant, and so on. An example of a text 


Produced in this way is: 


ulletin he brought home a turkey will die 


the fi i d on the b 
irst list was poste sleet are destructive and playful students 


On my rug is deep with snow and 
always (G. A. Miller and Selfridge, 1950) 


This is called a fourth-order approximation to English, because at each point a 
Participant sees only the last three words of the sequence and must add the new 
fourth word. According to this nomenclature, a zero-order approximation to 
English has words chosen randomly, and a first-order approximation has words 
Chosen randomly but in proportion to their frequencies in the language. The 
following compares samples from several orders of approximation to a sample of 
English text (from Miller and Selfridge, 1950): 


Zero-order: byway consequence handsomely financier bent flux 
cavalry swiftness weather-beaten extent; | 
First-order: abilities with that beside I for waltz you the sewing; 
Secord order: was he went to the newspaper is in deep and; 
Third-order: tall and thin boy is a biped is the beat; | 
Fifth-order: they saw the play Saturday and sat down beside him; 
Text: the history of California is largely that of a railroad. 


Miller and Selfridge found that the more closely a text approximated English, 
the more accurately it was recalled. The critical point is that if we simply add new 
Words on to the end of an existing sequence, in producing various approximations 
টি English, we cannot guarantee that we will generate grammatical sequences. 
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Common sequences of words can occur together in ungrammatical ways (“Haste 
makes waste not want not”) whereas rare sequences of words may be grammatical 
(e.g., “Sleeping green ideas dream furiously,” “‘Eschewing obfuscation mitigates 
prolix propensities”). We can tact the grammaticality of novel sentences, but such 
tacts cannot be based on whether the word sequences were likely or unlikely. It 
will not help us much to examine sequential dependencies by recording the relative 
frequencies with which particular word sequences occur in some sample of 
sentences. In fact, we might judge some sequences as ungrammatical even though 
they are probable, as in the saying, “The wages of sin is death.” 

One reason that sequential analyses cannot adequately describe grammatical 
structure is that they fail to include relations among words independently of the 
separation of the words. Consider, for example, the relation between chimpanzee 
and used in the sentence “The chimpanzee used sign language.” The same relation 
between these two words exists when the sentence is expanded: “The chimpanzee 
that was taught by the psychologist used sign language.” Yet we cannot represent 
that relation with a sequential analysis. The probability with which used will follow 
chimpanzee in any sample of sentences is not likely to be the same as the 
probability that used will follow psychologist. Sequential probabilities when two 
words occur in immediate succession will necessarily differ from the probabilities 
when other words intervene. 

Even if we could resolve that problem, we would encounter other difficulties. 
For example, sequential analyses would not help us to deal with ambiguous 
sentences. Consider the sentence, “Running experiments should be encouraged.” 
This might be read as recommending more support for research (“Encourage the 
running of experiments”) or more research on exercise (“Encourage experiments on 
running”). The grammatical structure depends on which interpretation is appro- 
priate, but we cannot base our interpretation on a sequential analysis, because the 
Same words occur in succession whichever way we read the sentence. The two 
structures therefore cannot be distinguished by word sequences. The circumstances 
in which the sentence was generated might be useful. For example, it might make a 
difference if the sentence were uttered in a psychology laboratory rather than at a 
track meet. But this is a functional distinction and not a structural one. In any case, 
the point remains that a sequential analysis is inadequate for dealing with sentence 
structure. This does not mean that sequential processes cannot or do not operate in 
verbal behavior. What we say at one moment must sometimes influence what we 
say in the next. We have called such relations intraverbal. We learn verbal sequences 
along with, or even independently of, grammatical structure (e.g., as when a child 
learns to recite the alphabet). Although sequential dependencies enter into verbal 
behavior, they cannot define grammatical structure. 


Constituents and phrase 
structure 


One aspect of grammatical structure derives from the constituents or 
components of particular sentences or classes of sentences. We determine the 


Psycholinguistics 
259 


EEE of a sentence by noting how its parts are related to each other. 
COTE Ee sentence A word to the wise is sufficient.” We can name its 
EE nts: word is a noun, is is a verb, sufficient is an adjective, and so on. The 
For ee not defined simply by how close particular words are to each other. 
onal p 5 word is is tied grammatically more to word than to wise, even 
Een oser to the latter. The relations among words in sentences have been 
Ve in various ways (es., Wundt, 1900; Chomsky and Miller, 1963). Three 
Ht epresentation, sometimes called phrase-structure diagrams, are illustrated 
in Figure 11-1. f 


lL. BOX DIAGRAM 


k to the wise is sufficient 
Article Preposition Article Noun Verb Adjective 


Noun Phrase 


Noun Phrase 


Verb Phrase 


Noun Phrase 


Sentence 
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Sentence 


Noun Phrase Verb Phrase 


Noun Phrase 


Adjective Noun Verb 
Adjective Noun Phrase 
Article Adjective Noun 
One picture is worth { thousand words 


Hl. BRACKETING DIAGRAM 
( La (picture) J [to) (the) [wise] ৷] H Fi)Cceauneent]}) 


Fi 
5) e 11-1. Three methods for representing the constituent structure of sentences. 
dvi es how a sentence can be analyzed into structural units ranging from 
the br. ual words to phrases. Note that the sentences in the box diagram (D and 
the tr ing diagram (III) have equivalent structures. In recent psy cholinguistics, 
e টি diagram (II) has probably been the most common form of representation. 
verb Re where the branches of the tree come together are called nodes; thus, 
rase in the example is a node at which verb and noun phrase come together. 


We could go on now to discuss properties of noun phrases and verb phrases 
A On. But the point is not to argue for one or another kind of description of 
Its of a particular sentence. Rather, it is to illustrate the nature of an analysis 


and 5 
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of constituents. The illustrations in Figure 11-1 mainly involve the naming or 
tacting of constituents. How do we decide whether a word should be called one 
type of constituent or another? We distinguish among constituents not by the 
words alone, but by their relations to other words in the sentence. We cannot 
classify constituents unless we also identify the structure of the sentence. 

Consider the following example: “He whose laughs last laughs last.” The 
word laughs appears twice, first as a plural noun and then as a verb; the word last 
also appears twice, first as a verb and then as an adverb. What we call laughs or last 
therefore depends on the relation of each word to other words in the sentence. 
This poses a problem. It appears that we cannot identify the structure of a sentence 
without classifying its constituents, but this last example demonstrates that neither 
can we classify the constituents without identifying its structure. How then can we 
ever describe the structure of a sentence? To resolve this problem, we must identify 
invariances of sentence structure. These invariances are not in particular words or 
word sequences, but rather in various coordinations among words. We can identify 
words of a sentence as members of particular classes (as particular kinds of 
constituents) because we have learned typical structures in our language (e.g., 
patterns of agreement between singular and plural nouns and verbs). 

We may be tempted to look to the environment as a basis for the structure 
of sentences. But even if we show that certain kinds of sentences are likely to be 
uttered in certain kinds of circumstances, we cannot appeal to such circumstances 
as a basis for our naming constituents. This point can be demonstrated simply. We 
can name the constituents of some sentences even when the sentence is made up 
partly of nonsense words. For example, compare the sentences “He who guffs 
merts” and “She merts his guffs.” Even though guffs and merts are not standard 
words in our language, we would call each a verb in the first sentence whereas we 
would call merts a verb and guffs a plural noun in the second sentence. 

We must conclude that our grammatical classification of words does not 
depend on the environmental events of which we speak. The classification depends 
instead on the sentence structures within which words appear. Verbs, for example, 
are not defined as the class of words occasioned by activities; they are defined in 
terms of conjugation or other grammatical properties. Compare the activity word, 
running, in the sentences “The child is running” and “The child’s running is fun to 
watch;” only the first instance is classified as a verb. The case is even more obvious 
when the activity word changes form with a change in grammatical structure, as in 
“The chess-player moved the knight” versus “The move surprised the other player,” 
or “The jury examined the evidence” versus “Their examination was thorough.” 

It would be incorrect to assume that environmental determinants are never 
part of our judgments of grammatical structure. For example, in treating sequential 
dependencies we considered ambiguous sentences. An instance is “Time flies.” In 
most circumstances, as when this sentence is uttered at a reunion of old friends, we 
call time a noun and flies a verb. But if the sentence were uttered in a biology 
laboratory as an instruction to record how long certain insects take to get from one 
place to another, we might conclude instead that time is a verb and flies is a plural 
noun. 
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We are left with a paradox. On the one hand there are sentences with struc- 
tures that allow us to name the constituents without knowing the circumstances 
in which they were uttered (e.g., “He who guffs merts”); on the other hand there 
are sentences with structures that do not allow us to name the constituents unless 
Wwe know the circumstances in which they were uttered (e.g., “Time flies”). Thus, 
any account of grammar that is either exclusively structural or exclusively 
functional will necessarily be incomplete. 

We must understand what we have accomplished so far. We have examined 
Some ways of talking about the structure of sentences, but we have not explained 
how we discriminate among nouns, verbs, and other parts of speech. We have not 
even discussed the dimensions upon which such discriminations might be based. 
We have merely shown that we can tact the constituents of sentences with some 
consistency (cf. Danks and Glucksberg, 1970; Gleitman and Gleitman, 1970, p. 10). 
So far, we have not identified any way independent of this tacting to specify the 
Properties of constituents and their relations to each other. But this does not 
Prevent us from speaking in the language of stimulus control. We faced the same 
Problem in the treatment of abstraction. Properties of events set the occasions on 
Which we respond with particular names, and verbal behavior itself can be included 
Among such events. b 

In any case, and perhaps more important, our response to a sentence is 
typically not a matter of naming its constituents or drawing 4 diagram of its phrase 
Structure. If someone asks us a question, We do not have to say which words are 
Nouns and which verbs before we can answer. Children learn to speak and to 
Understand sentences long before they are formally taught grammar and the parts 
Of speech in school. Our analysis of grammatical structure therefore must be based 
on Something more than our capacity to name constituents or to diagram structures. 

e must demonstrate the relation between these sentence properties and the 
Speaker’s or the listener’s behavior. Experiments that seek to demonstrate such 
telations are said to be concerned with the “psychological reality” of various 
Te of language (e.g., Fodor and Bever, 1965; Schane, Tranel, and Lane, 

Figure 11-2 provides an example (Johnson, 1965). People were asked to 
Memorize sentences with different types of phrase structures. When they were 
ater asked to recall the sentences, the probability of an error Was greatest at the 
major grammatical transitions. FOr example, errors were highly likely at the break 

Ctween the noun phrase and the verb phrase in both types of sentences, even 
ugh this break occurred at word transition 3 in sentences of type I and at word 
tansition 5 in sentences of type U. The matching of the types of sentences in 
umber of words and other properties demonstrated that grammatical structure 
and not simply numerical position or some other nongrammatical feature was the 


asis for the pattern of errors (cf. Jarvella, 1971, for a similar analysis at the level 


9 ৰথ, 
Phrase and d transitions). 
sentence rather than word tr y 
1 In other experiments (Fodor and Bever, 1965; Bever, Lackner, and Kirk, 
8) listeners wearing earphones heard a sentence in one ear and a click in the 


other, They were then asked to locate where in the sentence the click occurred. 


Johnson (1965) 
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Figure 11-2. Probability of an error in sentence recall as a function of word transi- 
tions in two different sentence types. Sentences of type I are illustrated by: (The! 
tall? boy) (saved* the dyingS woman). In this type of sentence, error probability 
was highest at the break between the noun phrase and the verb phrase (transition 
3). Sentences of type II are illustrated by: [(The’ house)? (across the‘ street )] 
5 [burnedS down]. In this type of sentence, error probability was highest at the 
break within the noun phrase (transition 2), and the next highest at the break 
between the noun phrase and the verb Phrase (transition 5). In other words, errors 
were more likely at those transitions corresponding to major breaks in phrase struc- 
ture. (Adapted from Johnson, 1965, Table 2.) 


When their judgments of click location were Wrong, the clicks tended to be 
displaced in the direction of major sentence boundaries. This displacement did not 
depend on such acoustic properties as locations of pauses or speech intensity 
(Garrett, Bever, and Fodor, 1966). For example, the same tape recording was used 
to produce all but the opening words in pairs of sentences such as the following: 
(In her hope of marrying)X(AnnaYwas surely impractical) and (Your hope of 
marryingX Anna) Y (was surely impractical). The parts of the recordings at which 
the major sentence boundaries occurred were acoustically identical. Nevertheless, 
a click in the middle of the word Anna was more likely to be heard displaced 
toward X than toward Y in the first sentence and toward Y than toward X in the 
second. These experiments have been controversial (e.g., Dalrymple-Alford, 1976; 
Glucksberg and Danks, 1975, chapter 4; Holmes and Forster, 1972; Reber, 1973; 
Reber and Anderson, 1970), on the basis of such factors as possible biases in the 
listeners’ reports of sentence locations. Whether the results are interpreted as dis- 
placements in the heard locations of clicks or as Systematic errors in the listeners’ 
reports of those locations, they remain as demonstrations of the effects of sentence 
structure on the listener’s behavior. This is the sense in which we may be justified 
in saying that such structures have psychological reality. 
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Transformations 


In speaking about sentences, we may note that some sentences say different 
things while others say the same thing in different ways. In Judging sentences this 
Way, we are not discriminating among particular sentences or constituents. Instead, 
We are discriminating among certain relations between sentences. We cannot 
describe such relations solely in terms of constituents, because sentences with the 
Same constituents in different orders may differ in grammatical structure. For 
example, consider the following three sentences: “He who hesitates laughs last, 
“He laughs last who hesitates,” and “He who laughs last hesitates. The first two 
Sentences have something in common that neither has in common with the last. 
We woulq say that the first two sentences “mean the same thing. Although the 
Word orders differ in the first two sentences, the structural relations among their 
Constituents are the same (e.g., who is similarly related to hesitates in both 
Sentences). We therefore call the second sentence a grammatical transformation of 
the first, 

Transformations describe changes in the structure of sentences that preserve 
certain relations among the constituents. When we transform a sentence from active 
to passive voice (e.g., “The rat pressed the lever” and “The lever was pressed by the 
rat”), we preserve the subject-object relations among the nouns and the verb. When 


nother, we are simply discriminating some of the structural features that the 
Sentences Share. f 
Once structural equivalences among sentences have been described by trans- 
formations, kinds of sentence structures may be distinguished by how they can be 
ansformed. For example, consider the sentences, “He is hard to understand and 
e is last to understand.” These sentences differ only in the adjectives, hard and 
“St. They seem similar in structure, but we can demonstrate a difference by 
Xamining how each may be transformed. We can transform the first sentence to 
© understand him is hard” but not to “He understands hard. Conversely, we 
can change the second sentence to “He understands last” but not to To under- 
Stang him is Jast.” Although the original sentences appear similar, we can distin- 
Buish their structures by their different relations to other sentences. In the language 
Psycholinguistics, the sentences are called similar in surface structure (the 
Particular order of constituents) but different in deep structure (the underlying 
Sructuray features that distinguish them). 
We sometimes judge how well a phrase or a sentence has been understood by 
Whether it can be paraphrased appropriately (cf. Gleitman and Gleitman, 1970). 
Or example, an answer is often related by grammatical transformation to the 
Question that is asked. If a question like “Who bit the dog?” was answered by “The 
OE didn’t bite anybody” instead of by “Nobody bit the dog,” we might assume 
€ question had been misunderstood. Again, we cannot appeal to semantics for our 
Vassification. The classification of transformations does not specify the conditions 
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Waste will be made by haste. -FPv FPvO - Will waste be made by haste? 
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Haste will make waste. -F ——— FO - Will haste make waste? 


Na: \ 
Waste will not be made by haste. -FPVN— FPVNO - Won't waste be made by haste? 
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Haste will not make waste. ~- FN FNO -| Won't haste make waste? 


Waste is made by haste. |- Pv — PVO- Is waste made by haste? 
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Haste makes waste. - KERNEL — Q- Does haste make waste? 


Waste is not made by haste. - PWN PvNO - Isn't waste made by haste? 
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Haste does not make waste. - N NO -| Doesn't haste make waste? 


Waste was made by haste. |-PaPv — PaPvOQ - Was waste made by haste? 


ZN 


Haste made waste. ~ Pa ———— PaQ - Did haste make waste? 


VN 
Waste was not made by haste. - PaPvN— PaPvNO- Wasn't waste made by haste? 
0 
PaNO- Didn't haste make waste? 


Le 
Haste did not make waste. ~ PaN 


KERNEL  - Base sentence (present tense, active voice, positive, and declarative) 
F - Future Tense 

N - Negative 

Pa - Past Tense 

Pv - Passive Voice 

[e! - Question 


Figure 11-3. Some transformations of the kernel sentence, “Haste makes waste.” 
The middle block shows transformations from declarative to interrogative (state- 
ment to question, left-right), from active voice to passive voice (front-back), and 
from positive to negative (top-bottom). For each sentence type, the transformation 
to future tense is shown in the upper block and to past tense in the lower block. 
The closeness of the relation between two sentence types depends on the number 
of transformations that separate them (e.g., the future “Haste will make waste” 
is closer to the kernel “Haste makes waste” than to the past passive question “Was 
waste made by haste?”) 


under which any particular transformation occurs. Instead, it shows how different 
sentences may be related. For example, we can transform or paraphrase sentences 
even if nonsense syllables are substituted for some constituents (e.g., given the 
sentence, “The wizzle lerped the kalaka,” we would identify “The kalaka was 
lerped by the wizzle,” but not “The wizzle was lerped by the kalaka,” as itS 
equivalent in passive voice). 

Figure 11-3 illustrates the relations among various transformations and com- 
binations of transformations. The base sentence on which the transformations 
operate is “Haste makes waste.” This base sentence provides the core material for 
transformations, and is called the kernel (in a stricter interpretation, even this 
present tense, active voice, positive, and declarative sentence is a transformation 
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from the raw material that makes up the sentence, i.e., the subject noun, “haste,” 
the infinitive verb, “to make,” and the object noun, “waste”). Possible transforma- 
tions include present to past or future tense, active voice to passive voice, 
declarative to interrogative (question), positive (affirmative) to negative, or any 
Combination of these. Even this list of transformations, which generates 
twenty-four unique sentences, is only partial (e.g., consider also changes from 
Singular to plural, as in “The cat pauses” and “The cats pause,” or changes in 
Person, as in “I am bored,” “He or she is bored,” and “They are bored”). Further- 
More, some transformations may be regarded either as simple transformations in 
their Own right, or as complex transformations based upon a combination of 
Simpler ones (e.g., the question “Does haste make waste?” might be a simple 
transformation of the kernel sentence “Haste makes waste,” or it might be derived 
indirectly from the intermediate emphatic transformation, “Haste does make 
Waste”), 

What advantage does speaking in terms of transformations have over simply 
labeling Sentences according to tense, case, and so on? With transformations, as 
With Constituent structure, we again face the problem of the behavioral significance 
(or Psychological reality) of these properties of verbal behavior. One interpretation 
S that the transformations correspond to something a listener or reader actually 

oes in responding to a spoken or written sentence. This is equivalent to saying that 
the transformation of a sentence from one grammatical form to another isa kind of 
havior Although such behavior may not ordinarily be observable directly, it 
Might be measurable on the basis of such properties as its duration or its effects on 
other behavior (e.g., Mehler, 1963; G. A. Miller, EE 

In one experi avin and Perchonock, 
Sentences 0 jee ee of eight words, as in the following: “The ball was 
Ht by the boy. Grass, cat, truck, hour, desk, rain, shirt, blue.” The transforma- 
‘onal complexity of ihe sentences and the list of words varied from one 
Presentation to the next. The listeners were instructed to repeat the sentence 
Verbatim after each presentation and then to recall as many of the eight words as 
PossibJe. on the average, the number of words recalled decreased with increasing 
transformational COTY of the sentences. For example, the mean number of 
Words recalled was 5.27 after a kernel sentence, 4.67 after a question, 4.55 after a 
ASSive Sentence, and 4.02 after a passive question. In other words, fewer words 
Were recalled after sentences with two transformations (e.g., passive question) than 

ter those with only one (e.g., passive), and fewer after sentences with only one 
Tansform tion than after those with none (kernel sentences). The finding is 
Consistent with the assumption that the transformational complexity of the 
Sentence limits the listener’s capacity to recall words from the list. 
1 But other factors complicate the interpretation (Glucksberg and Danks, 
62) W. A. Matthews 1968). Transformations are usually longer than kernel 
ntences (compare “The CAE hase the mouse” with “The mouse was chased 
ae Cat”). Even with sentences matched in number of words listeners typically 
ন less time to repeat kernel sentences than transformations of the kernel. Thus, 
results may depend on the time that repetitions of the various transformations 


1965), listeners were given 
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imposed between the end of the eight-word list and the opportunity to recall the 
eight words: the longer this time, the fewer the words correctly recalled. 

Other experiments demonstrated that kernel sentences can be more quickly 
memorized than transformations (e.g., Mehler, 1963), and that grammatical 
structure affects the likelihood with which sentences are accurately heard when 
they are masked by noise (e.g., G. A. Miller and Isard, 1963). A more direct study 
of transformation as a kind of behavior gave readers two lists of sentences (G. A. 
Miller, 1962). Each sentence in one list was related by a particular transformation 
to one sentence in the other list. The reader was given one minute to match 
sentences from the first list to corresponding sentences in the second list. For 
example, sentence pairs such as “John warned the small boy” and “The small boy 
was warned by John” were used to make up two lists that differed by the 
transformation from kernel to passive. The time per transformation was estimated 
from the differences in the average time taken per correct match in various pairs 
of lists. According to this analysis, transformations from kernel to negative, from 
kernel to passive, and from passive to passive negative took from 1.1 to 1.8 seconds 
each, those from kernel to passive negative took 2.7 seconds, and those from 
negative to passive took 3.5 seconds. The different durations are consistent with the 
number of transformations by which the various pairs of lists differed: the first 
three pairs differed only by a single transformation, whereas the remaining pairs 
both differed by two transformations. (The transformation from kernel to passive 
negative can be broken down into two parts, first from kernel to either passive or 
negative and then to passive negative; the transformation from negative to passive 
is assumed to be indirect, by way of negative to kernel and then kernel to passive.) 

These findings suggest that transformations, like constituents, have “psycho- 
logical reality.” We may recognize transformations as properties of verbal behavior, 
but we must not assume that transformations explain verbal behavior. We are able 
to describe some properties of combinations of transformations, as when we speak 
of a past-tense negative question in passive voice. We can note how transformations 
relate complex sentences to the simpler sentences from which they are derived (the 
Psycholinguist speaks of the relation between a terminal sentence and its kernel or 
kernels). We might extend our account to more detailed features of grammatical 
structure (e.g., psycholinguists sometimes distinguish between obligatory and 
optional transformations, depending on whether general relations such as subject- 
predicate or other more variable properties of sentences are involved). Describing 
the structural properties of sentences is the point of grammar, but describing a 
sentence does not say how it was produced or understood nor does its production 
or understanding require a judgment about its grammaticality. 

Hierarchical organization. Transformations can involve not only changes in 
the form of sentences, but also ways in which sentences or sentence parts can be 
combined with each other. For example, the sentences “The canary sang” and “The 
cat ate the canary ™ in combination can become “The canary that the cat ate sang” 
or “The cat ate the canary that sang.” These types of sentence structures are called 
recursive, because the addition of new segments can recur again and again. With 
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ক সক sentence, “The cat grinned,” the combination sentences could be- 
Sinan thar ee that fe cat that grinned ate sang” or “The cat that ate the 
beginning 0 0 grinned. Sentences can be expanded by adding phrases at the 
and tightrec ্‌ the end, when their structures are respectively called left-recursive 
SUUGtures ISive, or by adding parts within Or around the sentence, when their 
embedded aE called self-embedding and self-enveloping. With the self- 
enveloping sts ence “The canary that the cat ate sang,” we can illustrate a self- 
Uf iE was SE by expanding it to “The music that the canary that the cat 
embedding ot ee Whether we regard the structure of this sentence as self- 
Hlsle as Off pe EE depends in large part on whether we started with “The 

ey” or with “The cat ate.” In the first case, the hierarchical 


Str 4 
Ucture might be represented as: 


The music. . . .. . Was off key. 
that the canary ... sang 


that the cat ate 
In the 
S a 
cond case, it might be represented as: 


sax the:cat ate... » 


the canary that... ... sang 


The music that... . .. Was off key. 
whieh ayer Words, we may describe a sentence as a hierarchical structure in 
4 given com eatures or components are necessarily subordinate to others. Whether 
nothing to kN is higher or lower in such a description, however, may have 
Produced O with the order in which components occurred when the sentence was 
above Sto understood. For example, we might decide that one or the other of the 
as part টক es was an appropriate description depending on whether the sentence 
One that ত a conversation that began with “I'm glad that the music stopped” or of 
egan with “I’m glad that the cat ate.” We may note again that we do not 


explaj 
In ৰ 
4 sentence when we describe its structure. 
Summary 


shh nave examined some features of a formal analysis of syntax or grammatical 
dependena entences are sequences of discrete units called words, but sequential 
cannot de | alone cannot be the basis for structural descriptions, because they 
the Wous es relations among words that are independent of the separation of 
constituents a sentence. Sentence structure, however, can be described in terms of 
and their Ol transformations. In our account of a sentence, we classify words 

ations to each other, and we examine how the sentence is related to 
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other sentences that include the same or related words or word classes. The various 
methods of labelling or bracketing or diagramming sentence properties allow us to 
describe structural relations within and among sentences (as when we distinguish 
among the alternative structures of an ambiguous sentence). For any sequence of 
words, we must at least be able to classify its constituents and describe its trans- 
formational relations to other sequences. These properties of sentences may also 
have behavioral significance; to the extent that transformations describe what we 
do when speaking or listening or reading or writing, they correspond to the auto- 
clitic processes discussed in chapter 10. Experiments concerned with the psycho- 
logical reality of constituent and transformational structure illustrate how the 
behavioral effects of these dimensions of syntax can be studied. 


SEMANTICS: THE MEANING OF 
VERBAL UNITS 


To study semantics is to grapple with the problem of meaning. In everyday talk, 
we sometimes ask what people mean when they say certain things or whether 
one person has understood the meaning of someone else’s words. A point early in 
our treatment of verbal behavior was that the traditional vocabularies of meaning 
and reference can be misleading. How can we reconcile the widespread appeal of 
the concept of meaning in our everyday language with the ambiguities of that 
concept in a behavioral account? 

One part of the problem is distinguishing between the production of verbal 
behavior by a speaker or writer and its comprehension by a listener or reader 
(asking people if they mean what they say is different from asking them whether 
they understand what we mean). Even though correspondences between language 
production and language comprehension are ordinarily established in the course of 
language development, we must not assume that they necessarily go together. For 
example, a case report on an eight-year-old boy who could understand spoken 
English but was unable to speak because of a congenital organic defect (Lenneberg, 
1962) demonstrates that comprehension can be independent of production. In the 
development of the psychology of language, behavioral accounts have tended to 
emphasize language production whereas cognitive and psycholinguistic accounts 
have tended to emphasize language comprehension. The treatment of semantics in 
this chapter, therefore, stresses comprehension: what happens when someone is 
said to understand a word or a sentence? 

Let us begin with an experiment in which listeners heard various passages, 
such as an account of Galileo and the invention of the telescope (Sachs, 1967). 
After listening to the passage, they heard a sentence and were asked to say whether 
or not it had been in the passage. The sentence was either identical to one in the 
passage (base sentence) or differed from it in one of three ways: a change in word 
order that did not affect grammatical structure (formal change); a change in 
grammatical voice (active-passive change); or a change in meaning (semantic 
change). The sentence to be identified was presented either immediately after the 
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Original sentence in the passage or after 80 or 160 syllables of additional text. 
Examples of the sentences used with the passage about Galileo are the following: 


Base sentence—He sent a letter about it to Galileo, the great Italian scientist. 
Formal change—He sent Galileo, the great Italian scientist, a letter about it. 
Active-passive change—A letter about it was sent to Galileo, the great Italian 
Scientist. 

Semantic change —Galileo, the great Italian scientist, sent him a letter about it. 


The Passages and base sentences were varied across successive presentations and 
different listeners. 

When the sentence to be judged immediately followed the text sentence, 
listeners identified this sentence as identical to or changed from the original 
Sentence with better than 80% accuracy. When the sentence was presented after 80 
Or 160 syllables of intervening text, accuracy decreased for all sentence types, but 
Accuracy for the semantically changed sentence remained greater than 75% whereas 
that for the other sentence types approached chance levels. In other words, the 
listeners were likely to recognize the sentence as different only if the meaning had 
been changed (semantic change); as long as the meaning remained the same, they 
did not discriminate the changed versions (formal and active-passive changes) from 
the Original one (base sentence). The listeners Were not remembering particular 
Words or particular word orders. They were remembering something more funda- 
Menta]; whatever it is that sentences have in common when we say they mean the 
Same thing. 

Other experiments also de 


‘emember semantic structure than specific 
1974; Bartlett, 1932: Bransford, Barclay, and Franks, 1972; Fillenbaum, 1966; 


intsch, 1974). For example, in one study (Bransford and Franks, 1971) listeners 
ward a group of related sentences, such as “The ants were in the kitchen” and 
The ants ate the sweet jelly.” They later heard a selection from the original 
Sentences plus new ones. Some of the new ones combined the original sentences, 
AS in “The ants in the kitchen ate the sweet jelly.” For each sentence, the listeners 
Were asked to rate their confidence that they had heard it before. They were 
Usualjy more confident that they had already heard the new combined sentences 
An they were that they had already heard the simpler sentences that had actually 
Cen presented. The listeners had apparently learned something more abstract than 
Particular Words or particular sentences (but see Katz and Gruenwald, 1974, and 
Ang, 1977, for alternative interpretations). ্ টু 
These findings should remind us of the intimate relation between semantics 
Syntax. When we examined transformations, We spoke of the Psycholinguist’s 

gor SH of deep structure; deep structure is the name for those properties of a 
tha Soe that remain constant over various transformations. For example, we Say 
i “The teacher wrote the definition” and “The definition was written by the 
cher” gre the same or similar in meaning. This implies that what we hold 


Const Rg 1 
i of a sentence is its s 

ey When we change the syntactic structure emantic 

ure. 


monstrate that listeners are more likely to 
words or sentences (e.g., Anderson, 


The measurement of meaning 


The experiments so far have mainly considered sentences, but more attention 
has usually been given to individual words. What determines what a particular word 
means? In what ways can the meanings of different words be related? Word 
associations were the basis for some attempts to measure meaning (Galton, 1879; 
cf. chapter 10). If listeners produced longer and more varied lists of words in 
response to one stimulus word than to another, the first word was said to be more 
meaningful than the second. In addition, it was assumed that the more closely 
related in meaning two words were, the more the lists of associates occasioned by 
the two words would overlap. For example, common associations to both “infant” 
and “baby” might include “crib,” “bottle,” and “diaper,” but none of these might 
occur as responses to “guitar.” The different degrees of overlap among the lists of 
associates are consistent with what we already know, i.e., that “infant” and “baby” 
are closer in meaning than either is to “guitar.” 

Some words occasion more associates than others. For example, concrete 
nouns like “wall” or “road” or “building” are likely to occasion more responses 
consistently than prepositions like “of” or “to” or “at.” Given that responses 
may be occasioned even by nonsense words, the meaningfulness of such words 
relative to each other and to standard words in the vocabulary can be assessed 
(e.g., Glaze, 1928). Word associations are not equivalent to meanings, but to the 
extent that we say that words are similar in meaning when we respond to them 
similarly, such a treatment of word associations is appropriate as a measure of 
meaning. 

There can be ambiguities in word association. For example, if the word 
“night” occasions the sequence “day, morning, sun, moon,” should the later words 
in the sequence be regarded as responses to preceding words in the sequence or to 
the stimulus word itself? In this instance, we might guess that “sun” was probably 
occasioned more by “day” and “morning” than by “night,” but whether “moon” 
Was occasioned more by “sun” and “morning” (which shares three letters with it) 
than by “night” is unclear. The responses occasioned when each word in the 
sequence serves separately as a stimulus word might clarify such relations. 

An attempt to measure meaning without such ambiguities was the semantic 
differential (Osgood, Suci, and Tannenbaum, 1957). Words were rated along 
dimensions provided by word pairs, like “happy-sad,” “hard-soft,” and “‘slow- 
fast.” The similarities among the ratings of various words were then determined 
by a statistical procedure called factor analysis. This procedure created a space 
within which various words could be placed; the closer two words were to each 
other in this space, the more alike they were said to be in meaning. Words like 
“good,” “beautiful,” “clean,” and “pleasant” tended to cluster together in this 
space, and to be distant from other clusters such as “bad,” “foul,” “dirty,” and 
“ugly.” 

The semantic differential was intended to deal with all the words in the 
lexicon. More recent efforts based on word ratings have concentrated on words 
within specific categories, such as color names and the vocabulary of the emotions 
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(e.g., Fillenbaum and Rapoport, 1971). Still another technique, called componen- 
tial analysis (Romney and D’Andrade, 1964), is based not on word ratings but 
rather on an analysis of the dimensions that distinguish words within particular 
Word groups. For example, kinship terms in English are based upon sex (mother- 
father, sister-brother), generation (grandmother-mother, son-grandson), and line of 
descent (mother-aunt, son-nephew). The problem is that only certain word groups 
lend themselves to such an analysis (e.g., to what dimensions might the language 
of facial expressions be reduced?). Verbal classes with such ill-defined boundaries 
have been called fuzzy sets (e.g., Rosch, 1973). 

Each method describes semantic relations among various words, and meaning 
might be defined simply as the relations measured by these methods, whatever their 
basis. This conclusion, however, seems unsatisfactory, perhaps partly because such 
definitions give us little to say about the behavioral significance of meanings. We 
already discussed meaning as a property of our responses to words. For example, 
if a familiar word like our own name is repeated over and over, we sometimes 
remark that the word loses its meaning; this implies that some response that occurs 
to the word at first drops out after several repetitions. | 

We have noted that the language of meaning can sometimes be misleading. 
For example, we are tempted to say in treating the preceding examples that we 
tespond to the meaning of words. But meanings are not properties of the words 
themselves; they are properties of our responses to the words. A written word in a 
language that no one understands has no meaning. When a word is said to have 
Many meanings (as in “bat” in a belfry and “pat” at a baseball game), it is the same 
Word in its different contexts; the meaning of the word changes only in the sense 
that We respond to it differently. Consider saying that a red traffic light means 

Stop” and a green traffic light means “go.” When we stop on red and go on green, 
he Ordinarily say that we are responding to the red light or to the green light; we 
0 not ordinarily say that we are responding to the meaning of the red light or to 
he Meaning of the green light. | 
হ্‌ One implication of treating meaning a: 

Ay be able to measure some of its properties. 


as a feature of verbal behavior is that we 
For example, we might see whether 


€ time taken to make each judgment was recorded. The true and false sentences 


Ee Constructed from an assumed semantic stru br 2 
egories defined by sets of relevant properties. An example of such a ierarchy is 
lb Own in Figure 11-4. The assumption is that membership in each class is defined 
oth by a set of distinctive characteristics and membership in the next higher class. 
«OT example, in this structure “canary” is defined by the properties “can sing” and 
Ss yellow” ang by membership in the class “bird.” According to this account, the 
‘Me taken for the judgment that a sentence is true should depend on where in the 
lerarch; tic is located. For example, of the 


Cal structu j characteris 
5 re a given 
ntences, “A canary can sing,” “A canary has feathers,” and “A canary breathes,” 
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the judgment should be quickest for the first sentence and slowest for the last. In 
the first case, the property “can sing” is a characteristic of canaries; in the second 
case, the property “has feathers” is a characteristic of being a bird, which in turn 
is a characteristic of canaries; in the third case the property “breathes” is a charac- 
teristic of being an animal, which in turn is a property of being a bird, which in 
turn is a property of canaries. (The structure allows for cases in which the 
characteristic property of a class may be incompatible with the properties that 
define the higher-level class of which the first class is a member; for example, 
because “can’t fly” is a characteristic of penguins, judging the truth of “Penguins 
can’t fly” does not depend on first judging whether penguins are birds.) 

Judgment times were consistent with the types of hierarchical structure illus- 
trated in Figure 11-4. Pressing the “true” button occurred a longer time after 
sentences like “A canary has skin” than after sentences like “A canary is yellow.” 
But the hierarchical structures upon which the various sentences were based were 
constructed by the experimenters, and did not take into account such factors as 
the relative sizes of the different classes (e.g., Collins and Quillian, 1970; Kintsch, 
1974; Landauer and Meyer, 1972; Oden, 1977; Wilkins, 1971). More important, 
later research (C. Conrad, 1972) showed that the different judgment times 
depended largely on the different frequencies with which particular word relations 
occurred in the language (e.g., “canary” and “yellow” and “sing” are more likely to 
occur together than “canary” and “breathe” and “‘skin”). The Collins and Quillian 
experiment stands mainly as a potentially useful methodology; it cannot be said 
that it has measured differences in semantic responses to words. 

Yet our failure to provide an explicit definition of the word meaning does not 
imply that meaning is meaningless. The word is, after all, part of our verbal 
behavior. When we speak of what a word means, we are presumably tacting some- 
thing about the relation between words and behavior. We may recall that an 
independent specification of what is tacted is not essential to the definition of a 
tact relation (cf. chapter 10). We have at least shown, however tentatively, that 
semantic structure can be described and measured (see also Kolers, 1966; Tulving, 


can move around 
Animal < breathes 


eats 
has fins 
En can swim 


has gills 


has feathers 
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Figure 11-4. Hypothetical semantic structure with three hierarchical levels. The 
classes at each level are characterized by sets of properties that define the members 
of each class. Only a few sample properties are shown for each class. (Adapted from 
Collins and Quillian, 1969, Figure 1.) 
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develops- If we show that human verbal behavior always has certain structural 
properties or develops in certain ways, we have still to answer functional questions 
about the circumstances in which verbal behavior occurs (similarly, an anatomical 
account of the different properties of flight in birds and bees and bats would not 
tell us where or when one of these organisms would take off or land). 

The issues have become more complex with demonstrations that chimpanzees 
can engage in verbal behavior. Early attempts to establish language in chimpanzees 
were unconvincing because they concentrated on language in the speech mode; the 
chimpanzee’s capacity to engage in differentiated vocal behavior, however, is 
limited. Successful demonstrations of verbal behavior in chimpanzees came only 
after investigators switched to languages based on other types of verbal responses, 
such as gestures (R. A. Gardner and Gardner, 1969) or the manipulation of objects 
(Premack, 1970) or button-pressing at a computer console (Rumbaugh, Gill, von 
Glasersfeld, Warner, and Pisani, 1975). Debates then arose over the comparability 
of these and natural human languages. Once some features of language had been 


demonstrated in chimpanzee language, the feature could no longer be regarded as 
uniquely human (cf. Linden, 1976). 


Language development 


The nature of language development in humans can have important implica- 
tions. For example, to demonstrate that some language skills differed from others 
in the extent to which they had to be explicitly taught presumably would affect 
educational practice. Most children, however, become fluent in their native 
language long before they begin formal education. Although there are many 
Eo fl je development of children’s language with age, we have much to 
earn about the factors critical to that dev ; i 
1970; Segal, 1975). lopment (cf. R. Brown, 1973; McNeill, 

An important feature of language is its novelty; when we write a sentence, it 
is likely to differ from any other sentence we have already written. We can deal 
with such novelty only in terms of the features that the new sentence shares with 
our earlier sentences; in other words, we must speak in terms of syntactic and 
semantic classes (cf. Esper, 1918). Such an account is appropriate even at the very 
early stages of language development. 

An illustration is provided by Berko (1968). Sentence completions of 
children ranging in age from four to seven years were compared with those of 
adults. The children were shown and read an illustrated sequence of sentences that 
introduced a nonsense word. The sentence to be completed was designed to prompt 
a different grammatical form of the nonsense word. For example, a sequence that 
occasioned a plural form was: “This is a Wug. Now there is HC া There are 
two of them. There 2 two _____.” The regular plural in spoken English i 
formed by -2 (as in “dogs”), by -s (as in “cats”?), and by -es (as in “horses”), 
depending on the sound with which the spoken singular ends (e.g., -z after voiced 
endings such as Dp, t, f, and -s after unvoiced endings such as b The children 
fairly consistently produced standard plurals with -z and 6 ee children 
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Tesponded wugz to the sample sequence above). Even though almost all of them 
Were able to give glasses as the plural of glass, only about one-third gave the -es 
ending for the plurals of novel words like tass and gutch and nizz. Thus, various 
ASpects of the standard plural usage in English appear to develop successively over 
time. Children may at first learn specific words and specific word sequences, but 
then they begin to master larger units such as particular classes of transformations 
Te Braine, 1963; Guess, 1969; Palermo and Eberhart, 1968; Palermo and Howe, 
70). 


Other evidence that the acquisition of language is a progression from 
Particular words to simple word groupings and then to syntactic and semantic 
Structures of increasing complexity is in the usage of irregular verbs, such as “go, 

Come” or “break” (Slobin, 1971). Among young children, the regular but non- 
Standard past tense forms are common: “goed,” “comed,” “breaked. Children 
do not usually start with these regular forms, however. Instead, the child first 
ans the standard but irregular forms as individual words: “went,” “came, 

broke.” Only after the child has learned some standard regular past tense forms 
the irregular forms displaced by the regular but nonstandard ones, even though 
gr liregular forms had been part of the child’s vocabulary for some time. he 
a Years later, the standard irregular forms reappear to become permanen 

Mponents of the child’s verbal behavior. The progression from standard irregular 
cop onstandard regular and back to standard irregular past tense CR is 
se ent with a progression from the mastery of individual words t k 
la cotyped and inflexible syntactic form to the varied structure of lluen 
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TWELVE 


Verbal learning procedures include serial and paired-associates learning, free 
recall, and the special case of verbal discrimination called verbal recognition. 
Verbal transfer and the phenomena of proaction and retroaction involve 
Ways in which different verbal learning tasks affect each other. Many of these 
terms are closely related to words considered elsewhere: associates to conse- 
quence; recall to class; recognition to cognition; and transfer to differentia- 
tion. Serial, through the Latin serere, to arrange Or attach, is related to series, 
sort and, perhaps as an arrangement of words, sermon. Proaction and retro- 
action combine the prefixes pro-, forward, and retro-, back, with a deriva- 
tive of the Indo-European root ag-, to drive. This root is the source not only 
of the behavior word act, but also of such words as agent and examine. 


The area traditionally called verbal learning is concerned with what happens as we 


earn word sequences, word combinations, and word contexts. These relations 
D 


‘Mong words correspond to the intraverbal relations of chapter 10. In fact, the 


Study of verbal learning might be defined as the analysis of intraverbal learning. 
his chapter examines the four major classes of verbal-learning procedures: serial 
eamning, paired-associates learning, free recall, and verbal discrimination. The litera- 
lure of verbal learning is extensive (e.8., McGeoch, 1942; Postman and Keppel, 
969; Tulving and Madigan, 1970); we will illustrate each class of procedures with a 
ample of some characteristic findings and some experimental and theoretical issues 
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VERBAL LEARNING 
PROCEDURES 


In a typical verbal learning procedure, we present verbal stimuli to the learner and 
then record verbal responses. Much is taken for granted in such arrangements. If we 
were to describe them simply in terms of verbal discriminative stimuli and differen- 
tiated verbal responses, we would be omitting some significant features. For ex- 
ample, the learner’s performance is only rarely established by differential conse- 
quences; instead it is usually established by instructions. The learner who is in- 
structed to learn particular items will ordinarily learn those items more quickly or 
more accurately than one not so instructed (this difference has been summarized by 
saying that intentional learning is superior to incidental learning; cf. J. G. Jenkins, 
1933). Furthermore, the consequences of responding may be hard to specify. The 
experimenter designates responses as correct and incorrect or right and wrong. 
When the learner responds, the experimenter may say “right” or “wrong,” or the 
learner may simply be shown the item designated as correct. These procedures are 
sometimes said to provide the learner with feedback or with knowledge of results. 
It is tempting to assume that telling or showing a learner that a response was correct 
is a reinforcing consequence, but such feedback can also be regarded as instruction 
(i.e., telling the learner that a response was correct is equivalent to saying, “respond 
the same way next time”). It may therefore be misleading to speak of the rein- 
forcing properties of being correct or the punishing properties of being incorrect. 

A variety of other circumstances may influence the learner’s behavior. For 
example, it may make a difference whether the learner is a volunteer or serves in 
the experiment for hourly wages; if the experiment ends as soon as a list is learned, 
the consequences of finishing early will be different for the learner satisfying 2 
course requirement than for the learner whose pay depends on the total time spent 
learning the list. These and other features of the experimental setting that deter 
mine the learner’s performance are sometimes called demand characteristics (Ome, 
1962). In verbal learning, we will mainly treat verbal stimuli and verbal responses; 
we will not be much concerned with the consequences of verbal learning. In some 
cases, it may be important to note such consequences, but usually we may assume 
that the consequences are fairly uniform throughout an experiment even if we 
cannot specify them. 

Verbal stimuli and responses have ranged from such simple items as numbers 
or the letters of the alphabet through nonsense syllables or words to such complex 
materials as sentences or extended texts. Particular materials will be discussed in 
the context of particular experiments. Here we will only distinnguish between 
nominal stimuli and functional stimuli. The nominal stimulus is the verbal item aS 
defined or presented by the experimenter; the functional stimulus is that feature of 
the verbal item that occasions the learner's response (cf. attention: chapter 7). For 
example, if the stimulus items in a learning task are the four 3-letter sequences 
PDQ, TLC, VIP and ETC, but the learner's responses are occasioned only by the 
first letter of each sequence, then the three-letter sequences are the nominal stimuli 
and the first letters of each are the functional stimuli. 
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Verbal learning experiments have used verbal stimuli in both written and 
vocal modes, but written stimuli predominated in the early days of verbal learning 
research. The advantages of written stimuli are that they are uniform, easily 
described, and can be presented at well defined rates and durations. A popular 
device for presenting verbal stimuli was the memory drum, illustrated in Figure 
12-1. Words were typed or printed on a paper loop or tape, and the roller advanced 
Successive items to the window where they were presented to the learner at a con- 
trolled rate. In contemporary research, slide projectors or computer displays have 
replaced the memory drum. In early verbal learning research, vocal verbal stimuli 
Were less favored because their uniformity and rate of presentation were 
harder to control. For example, an experimenter who had to read a list to 
several different learners might change the inflection or loudness of particular 
Words over successive readings. The advent of tape recordings reduced the magni- 
tude of this problem, and contemporary research uses either written or vocal verbal 
Stimuli, depending on the suitability of one or the other to a particular experiment. 
Similarily, the choice of written or vocal responses varies with the requirements of 
Particular experiments. Learners can usually respond more quickly vocally than in 
Writing, but the vocal response has to be recorded whereas the written response is 
its own record. 

~The major classes of verbal learning procedures are summarized in Table 12-1. 
name implies, serial learning is the learning of a series or sequence of verbal 

, as when one learns to count or to recite the months of the year. A series can 

€ learned in a variety of ways. For example, suppose a memory drum is used to 


teach the first ten presidents of the United States. The drum could present each 


Name for five seconds, and the learner could be asked to recite the entire list in 
s 


FR order only after all ten names had been shown. This procedure, called serial 
ecall, could then be repeated until the learner had given one or more correct repe- 
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TABLE 12-1 


Classes of verbal learning procedures. 


Name 


Description 


Examples 


Serial learning 


Paired associates 


Free recall 


Verbal discrimination 


Verbal recognition 


Learning the items of a 
verbal sequence in a 
specified order. 


Each of several different 
verbal stimuli occasions a 


particular verbal response. 


Each stimulus with its 
associated response 
constitutes a pair; the 
order of pairs is 
unspecified. 


Naming the items in a list 
without regard to order. 


Items in one class of 
verbal stimuli occasion a 
different response than 
items not in the class; in 
other words, a discrimin- 
ation in which the 
stimulus dimension is 
verbal. 


A special case of verbal 
discrimination. The 
property that defines the 
class on which the 
discrimination is based is 
whether the verbal item 
appeared in a specified 
earlier list or context. 


Learning to recite the 
alphabet; learning a poem. 


Given a state name, naming 
the state capitol; given a list 
of words in another language, 
providing the English 
equivalent for each. 


Naming the members of the 
Supreme Court; naming the 
States. 


Identifying the nouns in a 
paragraph of text; given 
names on index cards, sorting 
the cards into one stack of 
female names and another 
stack of male names. 


Given a list of names, saying 
which were from one’s high 
school graduating class; 
distinguishing between old 
and new technical terms 
while reading a text. 


titions of the list. Alternatively, as each name appeared in the window, the learner 
could be asked to say which name would appear next (e.g., on seeing Jefferson ke 
say “Madison,” and then on seeing Madison to say “Monroe,” and so on). This 
procedure, called serial anticipation, could also be repeated until some number of 
correct repetitions. Once a list had been learned by either procedure, it might be 
asked whether the learner could answer various questions about the positions of 
names in the list, such as “Who was the ninth president?” (Harrison), “Which presi- 
dent was Tyler?” (tenth), or “Which president came before Van Buren?” (Jackson). 
The roles of context and position are illustrated by a common well-learned serial 
list, the alphabet. Even though we all can recite the alphabet and can probably 
say quickly which letter precedes or follows any other letter, few of us can give 
without counting the eleventh letter of the alphabet or the numerical position of 
the letter S. 
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In the sequence on the left, the designation of correct items is arbitrary; on the 
right, the correct item of each pair is the one with the letter J. An alternative 
procedure is to present items one at a time, with the learner indicating (e.g., by 
“yes” or “no”) whether or not each item is in the designated class. As with other 
verbal learning procedures, presentations may continue until the learner meets some 
criterion of discriminated responding. 

One type of verbal discrimination is that between new and old items, as when 
a student in a language course distinguishes words already learned from new vocab- 
ulary words. This is called verbal recognition. The learner is given an initial list. 
Later, the learner is given another list with both new items and items from the inital 
list. The learner must identify (recognize) the items from the initial list. Except that 
it uses verbal materals, verbal recognition shares much with familiar instances of 
recgonition (e.g., as in recognizing friends in a group of people, or in recognizing 
places one has visited). All of these verbal-learning procedures have been applied to 
issues in the analysis of such areas as language and memory. We now consider each 
in more detail. 


Serial learning 


Verbal-learning procedures were an experimental realization of the associa- 
tionist principles developed by such philosophers as David Hume and James Mill. 
The associationists had advocated that human thought was based upon the associa- 
tion of ideas. Ideas were said to become connected or associated in various ways 
(e.g., similarity, common elements, contrast), but especially through contiguity in 
space or time. Later, with the beginnings of modern chemistry, analogies were 
drawn between the formation of associations and the chemical combinations of 
atoms into molecules. Ebbinghaus (1885) saw the possibility of measuring the for- 
mation of arbitrary associations. He invented the nonsense syllable as an item that 
had not yet acquired verbal functions in the language and therefore would not be 
contaminated by associations that had already been established (but it was later 
shown that nonsense syllables did vary in meaningfulness as measured by word- 
association procedures: Glaze, 1928). 

The typical nonsense syllable was a three-letter consonant-vowel-consonant 
sequence (a CVC trigram). Ebbinghaus constructed many such trigrams, excluding 
those that were already words in the language, and then arbitrarily made up lists of 
various lengths to be learned at a later date. Over a period of years, he learned the 
various lists he had constructed, recording such data as the number of repetitions 
required until he was able to reproduce a list without error. For alist of up to seven 
syllables, he required only a single reading for a correct reproduction; beyond that 
point, the repetitions required for a correct repetition increased with list length, UP 
to about 55 repetitions for a 36-item list. Later research (cf. McGeoch, 1942) 
showed that not only total learning time but also the learning time per item 
increased with list length. Another of Ebbinghaus’s findings was that meaningful 
material could be leamed more rapidly than nonsense syllables. In contrast to the 
55 repetitions required for a 36-item list of nonsense syllables, Ebbinghaus was able 
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to learn 80-syllable stanzas of poetry (from Byron's Don Juan in English) in about 
eight repetitions. { 

Serial-leaming procedures, together with other verballearning procedures, 
Provided the baseline for studying various factors that influence human learning. 
For example, the finding that spaced practice is typically more effective than 
massed practice (e.g., Underwood, 1961) is often cited as a basis for arguing that it is 
better to distribute study evenly throughout a semester than to cram it all in just 
before examinations. This finding is so familiar that it is sometimes difficult to 
Tecognize that early in its study it was regarded as counterintuitive. It Was oe 
that the massing of successive learning trials should allow less opportunity for the 
lamer to forget items from one presentation to the next than if the trials NE 
Spaced in time. But this was not the case, and the superiority of spaced over masse 
Practice eventually contributed to accounts of verbal learning in terms of inter- 
ference between items (the superiority of spaced over massed practice, however, ls 
Much more evident in motor-skill than in verbal learning; Adams, 1954). Ae 

Other variables studied include meaningfulness (lists of words rated high in 
Meaning fulness are learned more quickly than lists of words rated low), hE 

ility (lists of items that are easy to pronounce are leaned more quickly ডা En 
Of items that are hard), redundancy (lists of items organized in some way are Eo 
more quickly than random lists), similarity (lists of dissimilar items are ” 
More quickly than lists of similar items, where similarity 1s measured Ln Ea 
common Jetters or other shared elements), and so on (e.8., McGeoch, ES 

iller, 1958; Underwood and Goad, 1951; Underwood and Schulz, 19 os 
“alalog of the factors that affect verbal learning (e.6., learning as a Ae ই 
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and comparing the time taken to releam each list when the order of the items was 
the same as or different from that of the original list. This was called the method of 
Savings: the list was ordinarily learmed more quickly on the second day than the 
first, and the effects of changing the order of items were measured by the different 
amounts of time saved on relearning. Ebbinghaus studied savings with 16-item lists 
in original, scrambled, and reverse orders, and with lists of every other, every third, 
every fourth, or every eighth item (e.g., in a list based on every third item, the order 
4BCDEEFGHbecomesADGBEHCEP.As shown in Figure 12-3, the greatest 
Savings in relearning occurred with items in the original order, and the least with 
tems in the scrambled order. The reversed list produced more savings than any of 
the remaining ordered lists. From these data, Ebbinghaus concluded that both 
emote and backward associations had been formed during the original learning of 
each list, 

| We have not defined associations, but let us assume for the moment that they 
Simply correspond to discriminative control of a verbal response by a verbal stimu- 
US. We might then say that once the list had been learned, a given item might occa- 
Sion not only the next item but also the preceding one or the one after the next. If 
that is the case, serial learning may not be the way to study associations. It may be 
Appropriate instead to examine associations independently of a particular serial 
order, Serial learning might then be regarded as a special case of paired-associates 
Caming. For example, consider the serial learning of the following S-item nonsense- 
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iS Te 
syllable list: NAJ, BEF, LUJ, PES, CED. In the presense of NAJ the LE 
Structed to say BEF; in the presence of BEF, LUJ; in the presence of LUJ, a 
in the presence of PES, CED. These same relations could be established in দয) be 
associates list consisting of NAJ-BEF, BEF-LUJ, LUJ-PES, and PES-CE Ne 
sumably the leamer who mastered the original serial list would quickly lear 
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T to have been developed independen 


a g her 
he earliest experiments used colors rat 
the verbal 


» 1896), and paired-associates Rl 
d the effects of a variety of verba 
Properties (e.g., meaningfuln 
Stockwell, and Noble, 1958). 
The learner's task Appears simpler with a paired-associates list than with a 
Serial list: each stimulus item Occas: 


k ট of 
Ions a unique response item, and the order 


are before they can be O90 
sioned by their respective stimuli. These Components of paired-associates learning 
(or response availability) and association 
(Underwood and Schulz, 1960). 


Consider the following three Paired-associates lists: 


DOG - XUW 


DOG -18 DOG -3 
ANT - WIX ANT -21 ANT -1 
CAT - VUW CAT =73 CAT -4 
FLY - XUV FLY -85 FLY -2 
BEE - XOvV BEE -46 BEE -6 
COW - XIW COW - 92 COW -5 
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5 bly be re- 
ষৰ se integration would presumal 
ly Lo be TE learner’s ou eS than 
quired with the second list, ould probably at first include One BGAN 
theless, the learner's We 7 86, 93). We would say that Ee HEHE SPONSE 
OEE ODO Ee ত resHOnses included all the num i were not yet 
1S complete when ng te ccur even while some stimulus ea tetiinitial 
‘st and no others. a টী 8 items. Finally, the last list “E included only the 
enc LE EDRrORAAL oC The Garnet knew that the TOSPOASE sco tlonss i.e., the 
SEONG DOAN: Once Ee learning would comprise oR Sj elassoniales learn- 
digits | through ie ORR TETE and response es might assume that 
nL A first list than with the last js or availability rather 
i ৰ নট EE ro pl ALE of response integration 
€ difference was mai | HONE: 
than one of association. from response integration, loan EE 
Even after it is separated Dele a single RSE a of association 
ins complicated. Presented LG What extent, then, is the A ediokelher at one 
op be eared. very Tn pairs are presented Loe uo maze learning, in 
plicated JERE Y DERE HE to those in the history ating in the T-maze 
VE? (The circumstances are rn ler and simpler forms, ae studies on learning 
apt mazes evolved through 1 ) These considerations eg i single setting (e.g., 
se EC BIEOT SVE a to learning many 0 the psychology of 
ly Ee fe a Geldridled a lee tiot ET Senin was whether learn- 
Stes, 1964). The issue os ntroversy. The q Kk be 
“8 occurred gradually 2 established a little at a time 0 eight CVC-trigram 
“one fashion. Were RE 1957), a learner was SHOW hi timulus items 
In one experiment (Rock, ortunity to respond to the EE correctly 
Pairs, and then was EXOD: #0 cn until the learner was able ন: or the pairs in 
alone. This procedure was TOE group of learners, only the or ng rOup, a pair 
ol Ele resentation to the next. Ina Be oe stings 
the list was changed from one Te learner had responded PY d. In other 
Was retained in the list only ed and a new pair was substitute ঠি 
item; otherwise that pair was nl pairs retained over successive eat tn 
Words, for the second group, ne one presentation. The two groups earne their 
cfs ne Uwe tose leaned al rates; learning was as rapid when A 
Cight-pair lists at roughly equ repeated over successive presentations. চ rials 
Substituted as when i not yet occurred did not seem to contribute to 
On which a correct respo! 
learning. irs of CVC trigrams are equally difficult to learn. For 
Unfortunately, not all Bairs:C CEN-TER, which forms a common word, will 
eXample, a nonsense syllable pair like air, like NEC-RET, that does not fonn a 
be more easily learned than a bine? nt was that the procedure for the second 
Word. The problem with Rock’s VEE JR ESEipairsr lial WEE onic Ls les 
Eroup was also a method Hi ILD were retained, and more diffi- 
Sn EE When the experiment was repeated with CVC trigram Palrs 
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1962; see also Taylor and Irion, 1964). a 
least sometimes, associations can be Un 

Single trial. The reader who reads the pair OWL - NO, and then covers the p 

will certainly be able to respond a i 


sequencing or a ea 
ther issues relevant to t 


8 Of others are Considered in the section on 


€: 


-associates leaming in terms Of verbal stimuli that occa- 
Sion verbal responses Superficially s 


eems adequate. But, as with serial learning, re- 
An account, In serial learning, this was because 
Verbal sequences can be formed in i Order response units that are not re- 
nd responses (e.g., in typing or writing the word 

tas a stimulus: cf chapter 6). Similar prob- 
lems exist in paired-associates learning. Consider the following paired-associates list: 


MON - CAR SAT - YET 
GOL - BUT TAZ - BUS 
FRI -CAB 
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a ily 
is list has learned main 
who masters this or 
We might assume that the learner ES 
iets, In such experiments, EE ৰত Tales elven 
f lot of other : 
te ন EY i tems are nonse 
the It: tts he pe note that only three nee response items are 
lables; the others are abbreviations for days of the ie ED aa 
concrete and might lead the learner to compose LL EOS OT CE The fear 
bines them These and other factors may affect he bhp ae 
masters the list We have emphasized the rote ne eat A FE 
it | op 0 i havior more 
and res ut paired-associa j Be 
ties of সিল লা” CT many pairs of items, and may 
Varied than the designated response tenn ated HEE EL sims cess 
চহ seat) ing is established by instruc- 
Sions ও 3 seed pn Typically, such ET {পূণ * 
tions ther than by differential UE hen Satins controls 
: টা ED i etely arbitrary 
পা রক” লা verbal learning that is not completely 
টপ oh i iations. 
i SOCia 
will inevitably be more than the learning of as 


Free recall 


ired-associates list has 
One demonstration that a learner who has Et EE some of the 
learneq OS Than associations is that the EE if asked about the last 
item টী timulus item is presented. or der might recall that the stim- 
fee tf Ea revious section, the reacer m is unimportant, this 
সিখন পাশত 7 and SAT. If the order of items is ’ 
§ Items include 2 £ 
‘Ype of performance is called be পিল with nonverbal কক EE 
i can be i Val, ask someone to 
Wwe Rec hone ee of objects and ক "এল 2৯! OED 
Name the Heath in the collection. A common in ome StHEHSEIL Gathering. As e 
Asks us to ae the people we met at a party or 5 ed for procetures Withlists of 
technical term, however, free recall is usually reservi 
Verbal items. | it becomes obvious that a major concem L bE ন! 
ee THe defining characteristic a fran an CEPOL for the 
Some time intervenes between a discriminative stim ydlly present ESE of ees 
Tesponse that it occasions. Free-recall procedures us the items without th 
ity for the learner to name E € items withou e 
Once, followed by an opportunity t trial of a serial-learning procedure, except 
list. Thus, free recall resembles the firs e the items in their original order (Re. 
that the learner is not OR palred.assciates lists, with variations called 
Search has also examined re ট Underwood, 1959). 
modified and modified modified free লে লামলন Ue EL ) 
Given its similarity to serial-learning P Le 
that serial-position effects occur in free recall 4s We'’. Learners are 
recall items at the beginning and at the end of a list (Murdoch, 1962) 


be Surprising 
most likely to 
- These effects 
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he principle of 
fre often described using two Principles: primacy and recency. The Pp 
primacy states that th: 


jt ay in 
An item in a list is Only one of several factors that 5 

fluence its likelihood of recall. Consider the following SO-item list (after readi 

list, the reader may wish to Cov 


DOG 


SEAT ELEPHANT ACKS 
B TR 
Shanon Ono Bie Bean ac 
WIGWAM BUS 
FEAL VEEROKEE wor RUTULE ~ PUEBLO 
ORANGE YATCH ely CHEYENNE? ESE EK 
MOHAWK SAY TRAIN TIME HIDE 
FATE PRNTY Fox SKIN LION ট্ 
দি SE 5 5 POOSE 
APPRIOBILE APACHE HANDOLE CHANCE PAPOOSE up 
4 LEOPARD GRAPE CORN MOCCASIN 
Given the len 


items. Most learners il 
the end of the li M VE Se 


ill 

্‌ Tete CASIN). In addition, most learners Se 

the distinctive item i ই i ET eater likeli 

hood of recall of unusual items in a sn eis alles ee EE effect 

(von Restorff, 1933). It Car Monstrated With a variety of distinctive far 

8., a word in red e Of standard print) or size (e.g.,a ba 

; SO be effective (e.g., a single COMMON ডা 

eal names), Te Problem is that it is sometimes Co r 

Be distinctive nthe context of a 

j S much greater for one item on a list than en 

ight then be tempted to say that the item must have ne 
not explain “nything; instead, it simply demonstrates © 

an item distinctive. 


wi 
Probability 
neighboring items. We m 


distinctive. But this does 
condition in which we call 
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A drawback of attributing properties like distinctiveness or salience to stim- 
ulus items is in the implication that the important dimensions of free recall are in 
the list rather than in the learner. But the learner is not passive; the leaner behaves 
With respect to the list. For example, in free recall learners typically rehearse recent 
items, by repeating them vocally or subvocally. In one procedure (Rundus, 1971; 
Rundus and Atkinson, 1970), items from 20-item lists of words were presented for 
5 seconds each, and learners were instructed to rehearse aloud. Their rehearsals were 
tape-recorded. The learner had more opportunities to rehearse early items than later 
Items (e.g., the fourth item could be rehearsed during presentations of any of the 
remaining 16 items, but the sixteenth item could be rehearsed only during presenta- 
tions of the last 4 items). As illustrated in Figure 124, higher recall probability of 
early items (primacy effect) was correlated with the more frequent rehearsal of 
those items. The recency effect, however, Was not correlated with rehearsal; recall 
Probability increased for items at the end of the list, even though the opportunity 
to rehearse those items was limited. What the learner recalls may depend on what 
the learner does during and after the presentation of each item (perhaps a distinc- 
tive item is an item very likely to be rehearsed). Fo 

What a learner might do during free recall is not limited to rehearsal. We 
noted that the difference between serial recall and free recall is simply in the in- 
Structions to the learner. In serial recall, the learner is instructed to name the items 
'n their original order; in free recall, the learner is not told that order is important, 
Or perhaps is specifically instructed to ignore order. But this does not mean that 
items in free recall are named in arbitrary orders. In fact, the order of recall of 
lems often differs systematically from the original order in the list. Consider the 


12 


1.0 
Rundus and Atkinson (1970) 


I নু 
RL Recall 5 
5 [ 
2 6 Fs 
El [4 
» AF- 5 
E 2 
[- 
Ee aS 

oe Rehearsal 


Citi iio 
০্বতও ৰ চ67 891011121314151617181920 


Position of Item on List 


Figure 12-4. Relatio be 
্ n between number of rehearsals and probability of 4 
of twenty nouns were presented at a rate of five seconds per item. CEE Lists 
Ee to rehearse aloud, and were allowed two minutes of free r eIS were 
prese! ion of the list. (Adapted from Rundus and Atkinson, 1970 ) after 
» “igure 1.) 
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SO-item list Presented earlier. Word: 
throughout the list (e.g., American 


| 
tc.; trave 
MOCCASIN, etc.; animal Anes: DOG, CAMEL, ELEPHANT, TIGER, e 

Words: TICKET, AIRPORT 


Category are likely to be 


dimensions Of verbal ite 
related in s 


ith 
Ope with repeated recalls out 
“A. K. Bousfield and Bousfield, 
. It remains clear that the lea i 
lists. The Structuri i i 


Verbal behavior. 


Verbar discrimination 


the incorrect class. Verbal 
example, the leamer mi, 


5 . For 
© be established successively 
to respond “yes” or « 


Ms one at a time, and be Th otuotec 
tematively, the learner might Hors 
MSs printegq on them into two or eone 
On is visual search, as when som 


bal ite. 
to each item; al 
ds with Verbal ite, 
verbal discriminati 
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Eee to find verbal items in a list or in a text (e.g., Healy, 1976; Neisser, 
word in Ta 3). Examples are finding a name in a telephone book, looking up a 
ments i 2 dictionary, and checking a manuscript for typographical errors. Experi- 
they [e visual search differ from the other types of verbal discrimination in that 
tontor usually concerned with speed and accuracy rather than with the acquisi- 
control by stimulus items. 
Bilis 5? Simultaneous verbal discrimination, the Proce j 
ls items and their assignment by the experimenter to the correct or incorrect 
SDE ees arbitrary (e.g., pairs of nonsense syllables, with the items to be 
Epon chosen randomly). With such lists, the acquisition of verbal discrimination 
ghey S On such factors as whether some items appear in the list more often than 
Such (Ekstrand, Wallace, and Underwood, 1956). For example, a list in which 
rect ee appears only once is learned more slowly than a list in which some cor- 
Ms are repeated. 


Parti Once verbal discrimination is established, on 
icular response and the other does not. But the discrimination is arbitrary, and 


on Ee be relevant to the verbal discriminations already established in our every- 
ova anguage (e.g., discriminations among nouns, verbs, and other parts of speech, 
Ong semantic dimensions, or along dimensions of alphabetical order or spelling). 
Gn experiments have explored the acquisition of verbal discriminations in which 
ey feature is always a property of the correct item. For example, the correct 
tio might be defined as an item that includes a particular letter or letter combina- 
SE Or an item that does not include a vowel, or an item that falls into some 
emantic class. Such procedures are verbal cases of experiments on concept forma- 
tion (cf. Figure 10-2). 
a In still another type of arrange 
its context. Consider the following two ver 
Orrect item is indicated by an asterisk: 


dure we will emphasize, the 


e item in each pair occasions a 


ment, whether an item is correct is determined 
bal discrimination lists, in which each 


DESK-TABLE WOOD-GLASS 
* * 


COUCH-BUREAU SEOON-STEEL 
ফু 


CHAIR-DRESSER RUBBER-KNIFE 
FY * 
CHAIR-DESK CEMENT-NAPKIN 
# ফু 
TABLE-CABINET FORK-GLASS 
* FY 


In the first list, the designation of the correct item is arbi | 
appear in two pairs. DESK is correct in both the first and কাৰ i 
is correct in neither the first nor the fifth pair. CHAIR, however rte শপ 
third but not in the fourth pair. The learning of this list, therefo. DB TNere HG 
tional discrimination with respect to this item: CHAIR is cor ct when paired with 
DRESSER but not when paired with DESK. In the second li RS pte) 
ist, GLASS is repeated; 
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2 d upon 
ha hota on is base 
it is correct in the first pair but not in the fifth. The HUA the incorrect 
Semantic classes: the correct items are all Parts of a table setting, 


e 
i class, th 
of either 

items are common materials. Because GLASS can be a member 

learner’s Iesponse mus 


t 
f correc 
Made from each of the following three Sond Shepard, 
and incorrect items (correct items Are marked by asterisks: cf. Chang 

1964; Wickelgren, 1969): 


HIT *FIT *FIT 
*FIN *FIN *FAN 
*HAT *PAT +*PIN 
*FAN *PAN +*PAT 
PIT FAT FIN 
PIN FAN FAT 
PAT PIT PIT 
PAN PIN PAN 


t items 
From each set, we could make up a list Of 32 pairs (each of the 4 correc 
Paired with each Of the 4 incor i 


-) and 
Ons appear Only in Correct items (FL, PA) 
items (FA., PL | 


» through a Pr 
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DAISY MAE - *LT'L ABNER 
BD *ROBERT - MAY 
BLY ANNA - *BOB 
0-8 *BOY- GIRL 
5 +B *DANNY BOY - BETTY ANN 
JB. EL *BRUCE- BETSY 
AB. VO RONA - *WILLIAM 
ABEL LE DOROTHY - *GEORGE 
RENE - +BEN *KENNETH - ANTONIA 
Bin - CON *JOHN- ELIZABETH 
*BILL - CONNIE 


i tter B. At the 
Early in the list, the discrimination is based DEON ae, LR 
transition from letters to words, the semantic e: ee as india 
duced, with the male item always paired with the letter a Laon 3 
Ins, and ni eubsegUent ৰ ্‌্‌ tonite final pair also includes 
Controlled solely by gender. The gender SCE a tHE ELAS 
2 Teversal of the letter discrimination established at the Ee see dimension 
lassroom demonstration, the fading from the letter 


e pairs have been pre- 
Usually proceeds without errors after the first two or three p 


i j written items may be 
sented. In a more practical setting, the fading from pictures to 


ESR j d Rozin, 1973). 
an effective method for teaching early reading skills (Gleitman an 


Verbal recognition: A special 
case of verbal discrimination 


é ‘ be based upon which item 

The discrimination between items in 8 list may He EATON 
appeared 0 Ltn list. When this is the case, NR OLEAN: 
as verbal recognition. For example, only one item i all: CHIMES-WHEAT 
Appeared i Lash item list used earlier to illustrate free ন Teale 
WIGWAM-TERMINAL FUR-CORN, TICE ee Hecognize those items. 
ESS RS ne ao it, or underlining it; in a less 

© response might be pointing a ed i is i e th 

formal Stn ihe NOR might be saying, | EB) tHe el 
Appeared on the list.” This response is a tact contro e DLA Ens 
item but also by the circumstances in which that item VE ER SR 

The phenomenon of recognition is not RS familiar or anifaniilias are ex- 
tions based on whether people or places or objects ন 1967) EEE 
amples of nonverbal recognition. One study (Shepar ih WLS Re a Tat 5 Ne 
Of words, sentences, and pictures. The procedure wit যং is equa y 
divided between frequent words (e.g. child, SIPC) Bp 5 eas (e.g. Julep, 
Wwattled). Immediately after inspecting a S540 Word IN6:0 URE SOLTEOT tent 
fied 88% of the original words in test pairs made up of words from the list and new 
Words. Correct identifications were more likely for rare words than for frequent 
Words. The procedure with sentences used a list of 612 sentences (e.8., “A dead dog 
is no use for hunting ducks”). Immediately after inspecting the sentences, Observers 
correctly identified 89% of the original sentences in the test pairs made up of 
original and new sentences. In the procedure with pictures, recognition Was ex- 
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= 2 pictures. 
:। eries of 612 yk 
imes after observers had inspected ue from the series an 
AIOIEE BE CCDC EM ers shown pairs made up of apie tures. Correct identifica- 
After two hours, olen 99.7% of the Eke ন 5 58% after 120 days 
i ec fter 7 days, it a con- 
a new picture corr 3 days, to 87% a t permit a 
after YS, oes not p 2 
EON EUG ৯-৯ in this task). The study, oye ন stimuli relative to 
(50% is sec bE of recognition for ng" oe Secon aor oho 
i the sup - was discr' FOrGs 
clusion about the irs of pictures, the observer seen be 
5 ! alrs of pic bd দি bly never 
EE be a the original series) and pictures i orminadns between 
aa see ae frequent ones), the observer was dis 
with words (espe 


e of which 
WO Ww. i Ut only oni 

both of which had been seen before In many contexts b 

TAS both 0 a i \/ 

had appeared on the Original list. 


isfield an! 
Ms semantically related to items on the list (cf. Anisfi 
1968; Underwood and Freu 
Recognition may also 
are presented i 


Knapp, 


semi 
ld ICT 
Rt utano t). 
Old or the new item as new. B Je-choice i nl 
be presented together with Several new items (e.g., as in a multip items. 
UE toy Studies, the ney items are sometimes called Fe items i 
ON Accuracy CCreases a, ঠ distrac 
i S the proportion Of new or di 
Kintsch, 1968. cka, 1 Prop 
Correct j E 


1 f a 
sy 0) 
iY 
he probabil abi 
7). Tf the. observer guesses, tl 
eCreases wit 
Trect guess js 50% 
i 


To) ch 
the P 1 en 
h added distractor items ue 25% whe gyots 
en items are Presented in pairs a yr guessing: a ing ud 
is too smal is ee at fos eS DO items); the adjustment fo 


accomp® 
S$ € decrease in recognition accuracy 
Tractor items, 


ES 
ane 
HIE 8 
in the perfol got 
ন Cen assumeg that tecognition is involved in 
ET v 
Ot recogn; an ite ning 
Tem wi 


8? AU 
lear yu 
Je, when 2 t stim 61; 
Procedures. For example, tha L) 
Imulus item fi 
atin, 196 ; Tot occ 


e255 


n dist 


tom a paired-associates ‘pombsch pots 
1 the Appropriate response item ie as ge Tr 
items from therefore been SVgpested that recognition of i s lear ame 
touble js Necessa; condition for paired-associate he 
May so, eti enc: 
litem C 


খি irection. 
‘0 Work in the opposite dire 


ist only e 
tesponse jy. © 2 Stimulus item from the list ceo Ik 
টু erbal OO Se item resen a4 
ite Whereas free ror Tequires discriminate responding in the P ight mo 
these two equine, k 2 i SG 35 
is item i sen Je5°? os 
accounts fe f behavior icing the item in its রঃ Neverthe Fe oso 
related 0 erbal lamin, he Ve different properti - d sec pn 
‘Te acco Ang & assumed that recognition an 
Stages: first the leame; erson 
Een 


occurs Ll ho 
hat recall only 
ower, 1972) suggests tha S 
Crates item 


duce 
5, and then the learner pro 
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recogni 
ee from the list. Colloquially, we might say that the leamer must 
Or a first, and then must decide whether it was on the list. 

diggs foxes ees from such an account is that recognition accuracy ought 
licen যত ন or exceed recall accuracy. In many studies, recognition is superior 
Eth Dn the effect of guessing is taken into account (e.g., in a recognition 
Sal of ক new and old items, guesses will be correct half the time, but in re- 
These: do ey 0 CVC trigrams, the probability of correct guesses will be small). 
Superior to circumstances, however, in which recall accuracy is consistently 
most পাল accuracy (Tulving, 1974; Watkins and Tulving, 1975). The 
Ble lage Cases are with cued recall rather than free recall; in this context, 
help the ্্ং ates learning is regarded asa type of recall in which the stimulus items 
Ene Ee to recall the response item of the list. For example, in one arrange- 
leamey oR ter was first given a paired-associates list (e.g., glue-CHAIR). Next, the 
fiom the free-association stimulus items likely to occasion response items 
CHAIR, GLO OLS list (e.g., table, for which typical assocates might be 
ciation task Fi DESK, DINNER). Among the words produced in this free-asso- 
earlier a e leamer was then asked to identify those that had appeared in the 
Daired.associ “associates list (recognition task). Finally, the stimulus items of the 
recorded ( iates list were presented (e.g., glue- 2), and the learner's responses were 
nized Ee eEN task). In procedures of this type, learners typically recog- 
recall task r words from the original list than they were able to recall in the cued- 
ke Outcome not only makes untenab 
OW sim it is also likely to surprise t 

pler than recall. The important point is 


be Ee 
responsi P OPTiate in the first place. Recognitio' 
i S controlled by different classes of stim 


iffer 
en ্ é 
t ways, and therefore their comparison ma: 


Je the model of recognition as a com- 
hose who regard recognition as some- 
perhaps that the comparison may 
n and recall are different classes of 
uli; these classes are measured in 


y not ‘be justified. 


Pone 


Summary 


loti have examined the major classes of verbal-learming procedures: serial 
Shei Paired-associates learning, free recall, verbal discrimination, and, as a 
Case of verbal discrimination, verbal recognition. In these instances of verbal 


earn; 
li ! ef b 
tions 8, performances are typically established and maintained through instruc- 

j mposed on correct and in- 


Nolet re than through differential contingencies i L - । 
Covereq Ene Although each procedure seems superficially simple, We dis- 
at the human learner, while mastering any of these tasks, inevitably does 
More besides (in paired-associates learning, for example, the learner’s be- 
May come to depend on relations among stimulus items or response items 
hen Bent pairs). We also concluded that it is inappropriate to regard verbal 
as other Or associations simply as stimulus-response units; human verbal behavior 

Structural properties more important than the ordering of events in time 


8., ৰি g 
fora tering in free recall). Now we tum to transfer of learning: once a per- 
Other PE has been established with one procedure, we may ask how the mastery of 


TOcedures will be affected. 


AVior 
Across 


TRANSFER 


Can the discipline of mathematics make one a more logical thinker? Will the 
mastery of a classical language like Greek or Latin make it easier to learn economics 
Or history or sociology? Does the study of music or art establish skills that will be 
useful in studying great works of literature? Research on transfer of learning or 
transfer of training actually began with simpler questions than these (Thorndike 
and Woodworth, 1901). Some of the Justification for such research was a common 
assumption: that the traditional educational curriculum was appropriate not simply 
because each subject matter might be important to the student in the future, but 
also because the disciplines “exercised the mind” OF, in other words, taught various 


ng one task could affect learning 
‘W. Yet each demonstration of the 
asks or of some other limitation on 
upported it. And the typical college 
uence of required courses to electives, 
Patterns of course options. 
role in this history (Joncich, 1968). 
instrumental learning in animals to 
ngs; this change was paralleled by his 
ঠি ers such as course requirements for 
potential teachers. His findings in various studies of transfer were not only contri- 
i £ Pi they influenced faculty committees respon- 
sible for establishing and reviewing cours 
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ng and Transf 
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Cn relative to learning Task Y alone. When learning Task Y after Task X is 
Tale ¥ LE to learning Task Y alone, transfer is said to be zero (zero transfer: 
With over not affect Task Y). When learning Task Y after Task X is more rapid or 
Huns et) than learning Task Y alone, transfer is said to be positive (positive 
ler Ce AS fe made Task Y easier to learn). When learning Task Y after Task X is 
veil wit more errors than learning Task Y alone, transfer is said to be nega- 
egative transfer: Task X made Task Y harder to learn). 
TAT history of research on transfer includes many types of studies, both 
Woodwo EE (e.g., transfer of motor skills from one hand to the other: cf. 
ihecific po 1938, chapter 8). Demonstration of positive or negative transfer in 
SUDA চল permitted only a cataloguing of particular transfer effects. Paired- 
Might a S learning suggested an analysis from which general principles of transfer 
Chgnes tn Fo studies of transfer could be based on changes in stimulus items, 
tees response items, or changes in their relations (Bruce, 1933). Table 12-2 
list on io several procedures in paired-associates transfer. The effect of learning one 
changed arning a second list can be examined when only the stimulus items are 
Whe » When only the response items are changed, when both are changed, or 
n both remain the same but their pairing is changed. 


TABLE 12-2 


ciates lists. In each hyphenated pair of 
lus items and the second represents the 
llable items are also shown for each 


So 

aes transfer procedures with paired-asso 

list 9 the first represents the list of stimu 
response items. Sample CVC nonsense-sy 


Procedure. 
Pr Designation Sample items 
cedure Listl © List2 Listl © List2 
Sti 
Sn change only A-B C-B lan-qip  fis-qip 
gS change only A-B A-C req-kiv  req-zam 
js mulus and response change A-B cD xal-pom cam-lup 
"Ame lists, stimulus and response A-B A-Br hab-lef  hab-mot 


ite j 
Ms re-paired guv-mot  guv-lef 


if dhe Eg only the stimulus items may produce positive transfer, especially 
a - original list included unfamiliar response items (ct. response integration in 
heh ROGUES learning); changing only the stimulus items, on the other hand, is 
econ i produce negative transfer, presumably because during the learning of the 
সম ist the stimulus items continue to occasion response items from the first list 
L arnes and Underwood, 1959). Ideally, changing both the stimulus items and 
lr items should produce no transfer if the relations among items is arbi- 
is (on practice, the procedure assesses the effect of learning successive different 
last earning set, sometimes also called generalized or nonspecific transfer: cf. 

Pter 7). The re-pairing of unchanged lists of items is one of several other varia- 
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tions on paired-associates transfer procedures. For example, the symmetry of asso- 
ciations can be studied by exchanging stumulus and response items (transfer from 
an A-B list to a B-A list), and the role of particular properties of items can be 
studied by changing lists in various systemic ways (e.g., given A and A’ as CVC non- 
sense-syllable lists differing only in item vowels, transfer from an A-B list to an 
A'-B list, or given B and B' as word lists in which corresponding items in each list 
are synonyms, transfer from an A-B list to an A4-B' list). 

Whether transfer from one Paired-associates list to another is positive or 
negative depends on the relations among the items in the two lists. The relations 
have been summarized on the basis of the similarity between corresponding first-list 
and second-list items (Osgood, 1949). Because similarity can vary separately for 
stimulus items and for response items, the effects are represented as a transfer sur- 
face, illustrated in Figure 12-5. Positive transfer is maximal when the stimulus items 


e 
£ 
6 

ey 

+ 

EE’ 0 

5 

£ 

= 


Negative 


urface (after Osgood, 1949). The plane, abdc, represents 


i hl items, Dimensi imilarity 
of stimulus items, and dimension bd repre. 3 Sion ab represents similar 


i 1 (as at c, for A-B to A-C transfer), and 
Points on the plane Tepresent zero transf ; i B 
transfer, to d, for A-B to C-D transfer). sfer (as along the line from b, for A-B to C 
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on eet items are identical in the two lists (4-B to A-B transfer); negative 
Sone Es when unchanged stimulus items must occasion second-list re- 
Tn UTS 5 f tat are incompatible with the first-list response items (A-B to A-C 
Ase oer in A-B to A-Br procedures, may also produce negative transfer). 
gist ond-list stimulus items differ more from the first list stimulus items, 
r approaches zero (A-B to C-B transfer, or A-B to CD transfer). 
fi he ey significance of the transfer surface, however, depends largely 
TEDL oe of similarity. Consider the learner who has learned LOUD as the 
tute for Lor particular stimulus item. New lists might be constructed that substi- 
(0 OISY) UD response items related in spelling (CLOUD), related as synonyms 
teen L or related as antonyms (SOFT). How are we to judge the similarity be- 
ings but fs and List-2 response items in these Cases? Are words with similar spell- 
but sini ifferent meanings more or less alike than words with different spellings 
ARG meanings? Are antonyms antagonistic response items, or should both 
examples en synonyms be treated as semantically similar? Should we call these 
speakin of A-B to A-C transfer, or are the relations sufficiently close to justify 
£ Of A-B to A-B' transfer? 
e in each case. The learner, having 
ulus item in the first list, would 


Toad; 

ans learn CLOUD in response to the same stimulus item in a second list. But 
ansfer from LOUD to NOISY or to SOFT would also occur quickly. Transfer 

jon than with the spelling or synonym 

that antonyms are semantically more 

to argue that similarity is better mea- 

ling or other properties. But sooner or 


1 
ln would encounter problems. For example, spelling is likely to be a more 
thous dimension of verbal items in some transfer procedures than in others. 
ably hea the transfer surface summarizes some features of transfer data, it is prob- 
of Stina EE as part of the definition of similarity. Similarity is not a property 
ollows i; rather it is derived from organisms’ behavior with respect to stimuli. It 
ano that doing the experiment remains the most appropriate way to judge the 
nt and direction of transfer from one task to another. 
An understanding of transfer effects may be critical to the sequencing of 


as ৰ & 
EE in instruction. For example, transfer may help to establish some formal rela- 
$ i imuli and responses (chapter 10). Consider 


th 
bn i i child learns the alphabet: 


»— — spoken “a” 


echoic Sspoken “a 

dictation Sspoken a” ———____Rviitten A 
transcription Swritten A ———Rviitten A 
textual S Written A —— — — spoken "a" 


d long before a child learns to read and 
r might be regarded as an instance of 
d that to dictation might be regarded 


Oi 
nari | | 
Write I echoic behavior is establishe 
4B to hus, the transition to textual behavio 

CB transfer (stimulus change only), an 
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as an instance of A-B to A-C transfer (response change only). We might HE Ee 
transfer in such cases would not necessarily be predictable from the st) St 
face. For example, if a child had never learned to write letters, we would not e i 
much transfer from echoic behavior to dictation. If transcription had been es ক 
lished, however, we might expect that transfer to textual behavior would be ea: 4 
Thus, the order in which the formal classes are established may be a critical aspe 


of the teaching of reading and writing; determining the most effective order is a 
problem of transfer. 


Proaction and retroaction 


We have considered cases in which leaning List 1 affects learning List 2. This 
is called a proactive effect, because the direction of effect is from an earlier to a 
later task. But learning List 2 may also affect subsequent List-1 performance. For 
example, after transfer from an A-B list to an A-C list, the learner who earlier had 


been able to give B response items to A stimulus items no longer may be able to do 
so (e.g., Barnes and Underwood, 1959 


Sponse items. This is called a retro. 
from a later to an earlier task. Mos 
Cases in which the learning of one 
mance on the other lists (Miiller al 
active and retroactive inhibition, 0 

As with other instances of t 
comparing performance on a Suc 
alone. The design of proaction stu 


); the A stimulus items now occasion C Ie 
active effect, because the direction of effect is 
t research on these effects has concentrated on 
list interferes with rather than enhances perfor- 
nd Pilzecker, 1900); such effects are called pro- 
r proactive and retroactive interference. 

ransfer, proaction and retroaction are studied by 


cession of tasks with performance on one task 
dies can be represented as follows: 


Experimental Group 


Learn Task 1 
Control Group 


Learn Task 2 Test Task 2 
Learn Task 2 Test Task 2 


The design of retroaction studies can be tepresented as follows: 


Experimental Group 


Learn Task 1 
Control Group 


Learn Task 2 Test Task 1 
Learn Task 1 


ক Test Task 1 
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dently in the two separate procedures. The procedures allow the study of proactive 
and retroactive effects among a variety of tasks (e.g., serial learning as Task 1 and 
Paried-associates learning as Task 2, or French vocabulary words as Task 1 and 
Spanish vocabulary words as Task 2). In a broader context, what is learned in one 
course may be an important determinant of what is learned in a later course (pro- 
action, as when preparation in algebra affects the mastery of calculus), but, con- 
versely, what was learned in the earlier course may also be influenced by a later 
Course (retroaction, as when the treatment of molecular bonding in an advanced 
chemistry course affects the terminology learned in introductory chemistry). 


Extensions to nonverbal procedures 


Instances of learning do not occur in isolation. Other learning preceded, and 
other learning is likely to follow. To the extent that every instance of learning may 
be affected by what came before and what follows, proaction and retroaction, and 
transfer in general, are relevant to all learning. A child’s learning of the alphabet 
may be affected by earlier nonverbal learning of discriminations among geometrical 
Patterns (e.g., Gibson, 1965). Upper-case letters, unlike lower-case letters, do not 
include any up-down or left-right reversals (e.g., p versus b and b versus d), and thus 
transfer from the upper-case to the lower-case alphabet may depend on whether the 
child has already learned nonverbal up-down and left-right discriminations. The 
Verbal learning in reading a laboratory manual may affect the ease with which the 
Student later masters various nonverbal laboratory skills. These examples are ex- 
ceedingly complex. We have seen that even in simpler cases, such as paired-associates 
learning, the study of transfer requires an experimental analysis of various com- 
Ponents of performance, such as association and response integration. b 

The simplest cases of transfer are those with a shift only from one discrimina- 
tion to another. For example, the reinforcement of a pigeon’s pecks in the presence 
Of green but not red might be followed later by reinforcement in the presence of 
blue but not yellow or of circles but not squares. We might expect more rapid trans- 
fer from one color discrimination to another than from a color discrimination to a 
form discrimination. Similarly, reinforcement of a rat’s jumps toward a left but not 
2 Tight stimulus might be followed later by reinforcement of jumps toward a light 

ut not a dark stimulus, or vice versa. We might find that transfer from position to 
brightness proceeded differently than transfer from brightness to position (the 
ailure of transfer in such procedures has sometimes been called fixation: N.R.F. 
Maier, Glaser, and Klee, 1940). 
Discrimination reversal may also be regarded as a type of transfer procedure. 
Or example, the reinforcement of a rat’s lever presses in the presence but not the 
absence of tone might be followed by reinforcement in the absence but not the 
Presence of tone. The reversal of performance usually is more rapid if the stimuli 
Are switched after many sessions of the original discrimination than if they are 
Switched after only a few sessions of the original discrimination (the overlearning 
tl effect or ORE: e.g., Capaldi and Stevenson, 1957; D’Amato and Jagoda, 
961; Reid, 1953). 
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Related phenomena are called transposition and reversal (e.g., Honig, 1962). 
For example, suppose a rat learns to respond to the larger of two squares in a dis- 
crimination apparatus. If the training squares are 2 and 4 centimeters in width and 
a new pair with widths of 4 and 8 centimeters is presented, the rat is likely to 
respond to the new larger square rather than to the now smaller square that it had 
responded toin the past. This transposition based on the relation between the 
stimuli will occur only within certain absolute limits; if much larger sizes are pre- 
sented (e.g., 16 versus 32 centimeter squares), the rat may show reversal, i.e., re- 
sponding to the smaller square, closer in absolute size to the one it had responded 
to in training. A theoretical account of transposition and reversal has been offered 
in terms of excitatory and inhibitory generalization gradients (Spence, 1937: cf. 
chapter 7). 

Some cases of transfer change discriminations within the original stimulus 
dimensions (e.g., from one color discrimination to another), and are called intra- 
dimensional shifts. Reversals are one type of intradimensional shift. Other cases 
involve a change from one stimulus dimension to another (e.g., from color dis- 
crimination to form discrimination) and are called extradimensional shifts. The 
effects of stimulus dimensions in discriminative transfer have been assessed by 
comparing intradimensional reversal shifts with extradimensional nonreversal shifts 
(Kendler and Kendler, 1962). These are illustrated by the following two cases; each 


task is represented by sample pairs of verbal-discrimination items in which the first 
item of each pair is designated as correct: 


Reversal shift Nonreversal shift 
Task 1 Task 2 Task 1 Task 2 
XON-map MAP-xon XON-map MAP-xon 
nij-TOY toy-NIJ nij-TOY NIJ-toy 


in each list, the items are CVC trigrams, but the item pairs differ along two dimen- 
Sions: nonsense syllable versus common word, and upper-case versus lower-case. In 


Task 1 in both cases, the correct item i i A 
is the nonsense syll in upper 
Case or lower-case. In Task 2 afte een hae I 


1 the reversal shift, the correct item is now the 
common word; whether the item is upper-case or lower-case remains irrelevant. In 
Task 2 after the nonreversal shift 


> 


however, the correct i i i CI- 
is ect item is the one in upp 
Case; the nonsense-syllable versus w ত 
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fere with learning other tasks (positive and negative transfer), and it may affect the 
performance on tasks learned earlier or later (retroaction and proaction). Whether 
transfer from one task to another is positive or negative depends upon the detailed 
relations of the stimuli and responses in the two tasks. The transfer surface appears 
to summarize these relations in terms of the similarity of stimuli and responses in 
the two tasks, but conclusions drawn from the transfer surface are made ambiguous 
by the nature of the similarity dimension; similarity is not a physical property of 
Particular stimuli, but is derived from the properties of behavior in the presence of 
these stimuli. We have considered mainly examples of transfer from human a 
learning, but in closing we noted that the role of transfer is a চং in 
ing, and we briefly considered some nonverbal examples of transfer. It is এহ 
to make informed estimates of transfer in both verbal and nonverbal cases, but i 

data are unavailable the best way to find out about the dimension and magnitude 
Of transfer from one task to another is to do the experiment. 


embering 


THIRTEEN 


Memory has its source in an Indo-European root smer- or mer-, to remember, 
through which it is related to remember and mourn. The root does not seem 
to be linked to the Indo-European men-, to think, which is the source of 
mnemonic, amnesia, memento, reminiscence, automatic and, perhaps most 
interesting, mind and mental. Forget has a source in the Indo-European root 
ghend-, to seize or take. Through the Middle English gessen, to try to get, it is 
related to guess, and through the Latin prehendere, to hold before, it is re- 


lated to apprehend and comprehend. 


The term memory is the popular name for the topics treated in this chapter, but the 
chapter might as easily have been entitled “Remembering.” The rationale for pre- 
Iring one usage to the other has been discussed by Woodworth (1921, Pp. 5-6): 

“remembering”; instead of “thought” we 
Should say “thinking,” instead of “sensation” we should say “‘seeing, hearing,” 


etc. But, like other learned branches, psychology is prone to transform its 
verbs i hat happens? We forget that our nouns are merely 
STUN r the things denoted by the nouns; but 


substitutes for verbs, and go hunting fo aE t 

there are no such things, there are on the: ach es that we started with, 

Seeing, remembering, and SO On... It is a safe rule, then, on encountering 
2 3 


any menacing psychological noun, to strip off its linguistic mask and see what 
manner of activity lies behind. 


Instead of “memory,” we should say 


with ways in which an organism's re- 


The study of memory is concerned ৰ } 
vent or events in the past. A stimulus 


5 t 

UB now may be occasioned by some € Sn 
Ay set the occasion for responding even when a delay is imposed between the 

Stimulus and the opportunity for a response. We will examine such situations, 

mphasiz; i hile the stimulus is present, during the time 
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i i hether 
Organism does immediately upon presentation of the ক can a re ORG 
i i in other words, whether the appropri 
the simulus is remembered (or, in nese 
ften speak of what the organi. 

CCurs later). Accounts of memory 0 i rene 
sfiinalus is কাক as memory storage, and of what the i CEs Tea 
response is later occasioned as retrieval from memory. Systematic rela ন OE 
stimuli to be remembered and subsequent responding occasioned by ol দন 
are often discussed as encoding; for example, the learner Who recites wor 


i j in 
as they are presented Visually is said to be encoding the visually presented words 
an acoustic (vocal) modality. 


ant events can occur during any of the three 


Stages of a memory procedure: Stimulus presentation, imposed delay, and oppor- 


with the response appropriate 
memory in terms of interference eavily on empirical studies of transfer, 
and grow out of the analysis of negative transfer in studies of Proaction and retro- 
action (cf. chapter 12). Finally, 


temembering may be affected by the conditions 
under which the learner is given an opportunit 


to those in which the sti 
will 


are based h 
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Memory is a complex subject matter with an extensive history (e.g., Murray, 
1976). We discussed some of that history in the treatment of verbal leaming 
(chapter 12), and will consider it further only in passing. We begin instead with two 
Contrasting examples of memory. First we examine the language of memory in the 
Context of relatively simple memory experiments with pigeons. Then we illustrate 
the complexity of human memory in the context of the memory strategies called 
Mnemonic systems. These treatments set the stage for surveying various research on 
human memory. 


THE NATURE OF MEMORY 


In our analysis of verbal behavior (chapter 10), tacts were defined as verbal re- 
Sponses occasioned by nonverbal stimuli. The presence of the stimulus at the time 
Of the verbal response was a constraint implicit in the definition of the tact. We 
May then ask how we should deal with verbal responses that depend on stimuli 
No longer present. For example, a witness to an accident may later describe the 
events at the scene, or may name the people who were present. These are the cir- 
cumstances in which we might speak of the witness’s memory or remembering (this 
!S also the behavior called recall in chapter 12). But how much difference is there 
tween tacting the events while they are occurring and describing them at a later 
time? The distinction is in the kinds of behavior that may occur between the event 
andits later recall. Such behavior may influence what is remembered. 

To illustrate these points, let us return again to the pigeon. Our rudimentary 
example of tacting in the pigeon used a chamber in which pecks on a left or middle 
Or Tight Key were respectively reinforced during red or blue or green light in a 
Window above the keys. In that example, the pigeon’s pecks occurred in the pres- 
ence of the lights. Suppose now that each light is presented only briefly, and the 

Cys on which the pigeon pecks become available only after each Presentation ends. 
the red light appears for only one second and then the pigeon immediately pecks 
the left key, it would still be appropriate to say that the left-key peck is under the 
Control of the red stimulus. We might still wish to call the left-key peck a tact of 
€ brief red stimulus, even though that stimulus is gone when the peck Occurs. 
Uppose we next impose a five-second delay between the brief presentation of red 
And the Pigeon’s opportunity to peck, and the pigeon continues to peck the red 
key, The left-key peck still is under the control of the red stimulus. We might be 
Concerneq with how the pigeon is able to respond consistently to the different 
Colors after various delays, but we do not yet have a reason for assuming that the 
ehayigy is different in kind from that when the pecks occur in the presence of the 
a Ors, Imposing a delay between a stimulus and an opportunity for the response 
hat it occasions does not necessarily alter the control of the response by that 
Stim, 

In সপ s by the color is likely to diminish even 
With 5 Teor a aE plait po ee will probably occur at chance 
evels ata delay of five seconds (cf. Blough, 1959). Yet we may be able to teach the 
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pigeon to remember. To do this, we might shape different performances after 5 
and blue and green, so that each performance would later occasion a peck on ৰ 
different key (e.g., pecking the left chamber wall after presentations of red, 5 
pecking the left key if pecking the left chamber wall when the keys becom 

available). If each color occasioned pecking in a different location, the Hr 
over which the pigeon could remember a color would depend only on how long i 

could maintain its pecking in that location during the imposed delay. Under such 
conditions, control by the colors could extend over delays of many seconds and 
perhaps even minutes. By interrupting the behavior during the delay, we could then 
show how the pigeon’s remembering depends on that behavior (Jans, 1976). For 
example, if we operated the feeder briefly between the stimulus presentation and 
the later availability of the keys, the Pigeon would move to the feeder and eat, but 
it might not then return to pecking at the same location as before the feeder opera- 


tion. In this instance, the interpolated feeder operation might reduce control of the 
key pecking by the colors to chance levels eve! 
seconds. 


It is tempting to Say that we cannot 


n at delays of only one or two 


really call the pigeon’s performance re- 
membering if the temporal gap between stimulus and response is bridged by con- 


tinued and localized behavior. Somehow, it seems appropriate to say that the 
Pigeon’s key pecks are more controlled by its previous behavior than by the colors 
as stimuli. Yet we modify our remembering by keeping calendars and appointment 
books. Consider the Person who looks up a number in a telephone book and then 
begins reciting the number repeatedly until able to dial it. If the dialing is success- 


ay that the number was remembered, even 
though the dialing might have been mo 


Zangwill, 1972, p. 130). These examples 
illustrate that questions about what m 


emory is can often be translated into ques- 
3 
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One well-established mnemonic technique is the method of loci (places). It is 
attributed to the Greek poet Simonides (Yates, 1966). Simonides is said to have left 
a banquet hall just before the roof collapsed and killed all the occupants. Although 
the bodies of the guests were not recognizable, Simonides was able to identify them 
for their relatives by where they had been sitting at the banquet table. This demon- 
stration of an orderly spatial arrangement contributing to accurate remembering is 
supposed to have led Simonides to inventing the method of loci. The method be- 
Came the basis for remembering the sequence of topics in speeches, and was 
described by the Greek orator, Quintilian: 


es in memory, he says, a building is to be 
d a one as possible, the forecourt, the living 
t omitting Statues and other ornaments 
he images by which the speech is to be 
ation on the places which have been 


In order to form a series of plac 
remembered, as spacious and varie 
room, bedrooms, and parlours, no 
with which the rooms are decorated. Th 
tTemembered ... hen placed in imagin 
EMG Eine This done, as soon as the memory of the facts re- 
quires to be revived, all these places are visited in turn and the Various de- 
posits demanded of their custodians. We have to think of the SEA 
as moving in imagination through his memory building bos 2 is nn I his 
Speech, drawing from the memorised places the images ie ey 2 So 
them. The method insures that the points are remembered AE oS er, 
Since the order is fixed by the sequence of places in the building. (Yates, 


1966, p. 3) 
ea systematic walk from one distinctive 


us landmark to another: the dormitories to the dining hall, and next to the 
Y, and next to the computer center, and next to the EE and so on. 
0 learn some ordered series, the student then imagines each item at each successive 
Ocation. To recall the series, the student would take the imaginary walk again, 
Temembering each item in its appropriate place. To learn anew sequence ata later 
time, the student repeats the imaginary walk, in the same order, this time visualiz- 
Ing the new items in their successive places. The new series will be learned with 


relatively lite interference from the first series (Christen a EE er 2 
€ first series m longer be as well yorneniberede Th METI OEY 
Ae mbered only temporarily (e.g., a shopping 


mai 
ES for series that need to be reme 


For example, a student might imagin 
camp 


libran 


series of concrete items that are 
ms Or combinations may have an 
tant is the spatial closeness or 


These mnemonic techniques work best with 
bizarre ite 


Visualized or imagined. Unusual or j 
more impor 


tage over i but even i i 
common items, bu i i airs of ob 
ction of the items and places. For example, in learning pai jects, 


if the objects are presented in some 
TS rememb irings more accurately i i iy j 
On to a (pe TS on top of the other) than if they are Sinnply pre 
d side by side (Wollen, Weber, and eT Greek and Roman times, 
Various i ices were deve!o Se i 
1 In the Nine STOIC ethods for remembering religious doctrine (Yates, 
966 Th e Ages became mM is for ritualized forms of religious art, as 
Whep ~ ey gradually became the basis 


i irtues. In the 
i j t the vices and the vir 
esen 
course or ss OEE AE 0 Origins of these art forms were gradually 


Of this evolution, the mnemoni 


easily 

Van 
Conne 
Carn 
telat 
Sente 
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(and ironically) forgotten. Mnemonic techniques were given relatively little ti 
tion throughout much of the history of psychology, and major interest ATDTO). 
significance for analyzing memory has developed only recently (e.£., oe Gino 

Some techniques are designed so that abstract items can be converte 
concrete ones. The peg-word technique is a simple method for translating num 0 
into a sequence that can be visualized, as in “one is a bun, two is a shoe, three A 
tree, etc.” An ordered list can then be learned by imagining each item together wi 2 
the object Corresponding to its numerical position in the list. These systems hav 
the advantage over the method of loci that the learner can recall the item in মি 
position without necessarily starting from the beginning of the list Ce.s., to rec 
the third item, the learner has only to remember what was imagined with tree). 

A more elaborate system provides a code for translating numbers into letters. 


This system has been a part of popular mmemonic techniques for roughly a century 
(e.g., Loisette, 1899). One version is the following: 


Number Consonants Rationale 
1 t,d t has one downstroke. 
Zz n n has two downstrokes. 
F- m ™m has three downstrokes. 
4 f r is the fourth letter of four. 
5 1 lis the Roman numeral 50. 
6 soft g,j Script g is an upside-down 6. 
7 &, hard c k can be combined with Rie He 
8 hv both 8 and script f have two loops. 
f D,b backwards p or upside-down b is 9. 
Zz, 


2 is the first letter of zero. 


t include an apartment building, a play- 
round, a school, a Pharmacy, 2 parking lot, and so on. Next, construct an arbitrary 
Sequence of twelve two-digit numbers (e.g., by consulting a random number tabl$) 
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as, for example: 66, 57, 28, 14, 11, 87, etc. From the code presented above, each 
Of these numbers can now be converted into a word. Each word should include two 
and only two consonant sounds; unsounded letters can be ignored. For example, 66 
Can be coded as “judge,” since the “d” is silent. Concrete words are preferable to 
abstract ones (e.g., for 57, “lock” is preferable to “ike”). Finally, each word, as 
derived, should be the basis for imagining an object or event at the successive loca- 
tions (e.g., a giant judge sitting on the roof of the apartment building, a padlock 
attached to the playground swings, etc.). At some later time, the successive loca- 
tions can be used to recall the words, and the words can be translated back to their 
respective numbers. On the first attempt with this system, it may be helpful to 
have the letter-to-number code on hand, but with some practice most users will find 
that an arbitrary number sequence can be learned with relatively little effort. Re- 
membering fifty or more numbers is not a difficult task for a skilled user of 
Mnemonic techniques. , { 
Such arbitrary feats of memory might be useful mainly for impressing one’s 
friends, but the main point is that they demonstrate the flexibility and capacity of 
Uman memory. Educational systems have tended to emphasize learning through 
understanding, and have correspondingly de-emphasized or even discouraged 
Memorization. It is unlikely, however, that the learner will be disadvantaged by 
camming in more than one way, and therefore mnemonic techniques can be an 
effective supplement to other methods of study (cf. Higbee, 1977). { 

Mnemonic techniques work, but they are a far cry from the simple example 
Memory in the pigeon discussed earlier. The techniques illustrate how varied 
the phenomena are that must be considered in the study of memory. Even if we 
cannot say why humans are more likely to remember concrete items that have been 
‘magineq than abstract items that cannot easily be visualized, it at least is known 
that What the learner remembers depends on what the learner does. These 
Mnemonic techniques are classes of behavior that can be learned. 


of 


্ঘ 
HE VARIETIES OF MEMORY 


required continued behavior that bridged 

© Bap between stimulus presentation and a response after an imposed delay. The 
Mnemonic examples did not include such uninterrupted intervening behavior. 
Clearly the cases are different. We would similarly distinguish between remembering 


An appoi { ’s calendar and rememberin 
. “Ppointmen it is written down on ones 4 
Wit EE i implies that memory can take different 


T 5 
ie eXample of the pigeon’s remembering 


rieties of memory. We will examine three 
of memory phenomena: the relatively brief persistence the effects of the 
Us; the maintenance of responding occasioned by heme Fa 10 Telearsal; 
’ Ppropriate responding after periods of time without rehearsal. These are often 
mn cribed respectively as iconic memory, short-term memory; and long-term 

mory, and have been the basis for many theories of remembering (e.8., Norman, 


aS a basis for classifying the va 


es 
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1968, 1970; Shiffrin and Atkinson, 1969; Sperling, 1963, 1967; Waugh and 
Norman, 1965). 


Iconic memory: The persisting 
effects of stimuli 


The effects of a stimulus may continue even after the stimulus presentation 
has ended. Persisting aftereffects of visual stimuli are called icons, and the topic of 
iconic memory is concerned with their time courses (for a discussion of the relation 
between icons and visual afterimages, see Sakitt, 1976). How is that time course 
to be measured? If an observer reports some letters that have been presented 
briefly, how is a report based on the sensory aftereffect to be distinguished from 
One based on the observer’s continued rehearsal of the letters? 

Humans are limited in the number of items that can be rehearsed or remem- 


bered after a single brief Presentation. This limit, typically within the range of seven 
Plus or minus two items (G. A. Miller, 195 


Memory. For present purposes 
The range of the span of immedi 


to nine words almost as easily i 


selves include many more than nine letters. 


The problem can be solved by showing more items in the stimulus display 
than can be included within th 


a second, but during this time the observer in effect can still read the items in the 
display even though the display is no longer there. 


This experimental rationale was developed by Sperling (1960); data from one 
experiment are shown in Figure 13-1. A four-by three matrix of letters and 
numbers was presented to Observers for 50 milliseconds (0.05 sec). An example of 
Such a matrix is the following: 


Le | — 1.00 
J Sperling (1960) 
| 
|| N=4 
10 || 
Ir | 
1 .75 
58 gs 
চু £ 
E S 
: 6- 4 50 EE 
| g 
চ S 
5 4 || x 
| 
| =| 3 
| Display | 
|| 
0 lL | 
—.10 0 .15 .30 1.0 | 
Delay of Instruction Tone (Sec) 


Immediate-Memory Span 


Figure 13-1. Characters available from a three-by-four matrix of letters and 
numbers when the signal for a report occurs at various times relative to the display 
of the matrix. The signal was a tone of high, medium, or low frequency that 
instructed the observer to report the characters in the top, middle, or bottom row 
of the matrix. With increasing delays of the instruction tone, the accuracy of the 
Observer’s report approached the immediate-memory span for this task. The data 
Are means across four observers. (Adapted from Sperling, 1960, Figure 7.) 


in Figure 13-1). The mean number of characters correctly reported from the single 
line was multiplied by three, and taken as an estimate of the total reportable 
Characters in the matrix (y-axis in Figure 13-1). t 
About 10 characters from the matrix could be reported when the instruction 
tone preceded the display or occurred as the display ended (delays of -0.10 and 0 
Sec); when the instruction tone followed the display with delays of 0.15 sec or 
longer, the proportion of reportable characters decreased with increasing delay. 
en the instruction tone was delayed by 1.0 sec, the number of reportable charac- 
ters was about equal to the span of immediate memory in this task (this span, of 
about four to five items, is at the extreme low end of the usual five-to-nine item 
Span of immediate memory presumably because of the relative complexity of the 
Observer’s task, including the mix of lettersand numbers and the matrix format). The 
region between the data points and the immediate-memory span (shaded area in 
Figure 13-1) is assumed to represent the persistence of the sensory effects of the 
h the observer can still read particular rows of the 


display; this is the extent to whic 
matrix when given the appropriate instruction tone, even though the matrix is no 


longer present. 
315 
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This experiment demonstrates that the sensory effects of a SR EEE 
for a brief time after the stimulus has been presented. Similar e ects a erGun 
demonstrated with auditory presentations of verbal stimuli, as in Hr es 
which different sequences of spoken letters were simultaneously presente 


i d 
auditory channels (left ear, right ear, and both ears: C. J. Darwin, Turvey, an 
Crowder, 1972). The effects 


Short-term memory: The ro/e 
of rehearsal 


The recall of a human learner immediate! 


about five to nin 
memory (Jacobs, 1887; G. A. Miller 


Y after the presentation of a verbal 


- R. Peterson and Peterson, 1959; Waugh ক 

Norman, 1965: see also Daniels, 1895; Ww. G. Smith, 1895; Murray, 1976). A ক 
irectly under the control of prior stimuli 
Control of other intervening behavior, such as fj ন 
Te way to add i ion is wi Fn 
Prevents the learner from eno: 5 woke SS this question is with a tas 


fs ehavior between the presentation of 
nity for their recall 


Memory 
317 


three co 
begin BOtnG a then a three-digit number. The learner’s instructions were to 
flashed, to recall Le Wards by threes from the number and then, when a signal light 
506; the learner th. he consonants. A trial might start with the spoken items, CHJ 
light, which set qr counted backwards, 506, 503, 500, 497, etc., until the signal 
Presentation of th the occasion for naming the consonants. The time from 
Under these on) consonants to onset of the signal varied from 3 to 18 seconds. 
delays of 15 or 1 oe recall accuracy decreased with increasing delay until, at 
conditions, learne seconds, the proportion correct was less than 10%. In two other 
Were Eloi ৰং rl were allowed periods of vocal or silent rehearsal before they 
Compares data fr ninber from which they were to count backwards. Figure 13-2 
oF ofther sien Ee the original procedure (open circles) and data with 3 seconds 
the backwards re. hearsal (filled triangles) or vocal rehearsal (filled squares) before 
determined দি -counting task. The way the learners responded to the consonants 
by oUF behavior well they were able to recall them. Recall is determined more 
These a respect to past stimuli than by the stimuli themselves. 
of memory: sho related findings led to theories that proposed two distinct types 
son, 1969) 1 rt-term memory and long-term memory (e.g., Shiffrin and Atkin- 
are nitained are said to be rapidly lost from short-term memory unless they 
itself, items j Ror through rehearsal; by some means, perhaps through rehearsal 
long-term n short-term memory are sometimes transferred to a more permanent 
but the SINONY. Only a few items at a time can be held in short-term memory 
capacity of long-term memory is virtually unlimited. Data from OIE 


1.07 
Peterson & Peterson (1959) 
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Figy; 
of the Ee Proportion of three-consonant items correctly recalled as a function 
one can Ee the presentation of the item and the opportunity for its recall. 
count Backs (open circles), rehearsal was prevented by instructing the learner 
sSonant tl wards by threes from a number that immediately followed the three- 
ndsof Limulus item. In two other conditions, the learner was allowed three 
before চি either silent rehearsal (filled triangles) or vocal rehearsal (filled squares) 
(Adapteq y TUDE that initiated the backwards-counting task was presented 
rom L. R. Peterson and Peterson, 1959, Figure 3 and Table 1.) i 
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detailed analyses of performances obtained with short-term memory procedures, 
icated this view. | 
Be ry without allowing rehearsal (Figure 13-2, opened circles) 
at first seem to represent the gradual fading or decay of the items to be 
remembered. But several lines of evidence suggest this is not the case. Consider 
first the effects of varying the number of letters to be remembered, as in Figure 
13-3 (Melton, 1963). A stimulus item containing 1, 2, 3, 4 or 5 consonants and 
then a three-digit number were presented visually; the learner read the consonants 
aloud, and then began counting backwards by threes or fours until a visual signal set 
the occasion for recall. When the item consisted of only a single consonant, recall 
accuracy remained high even over a 32-second interval. The proportion of items 
recalled decreased more rapidly over time as the number of consonants increased. 
But which set of data might then represent the time course of short-term memory? 


If recall of a single consonant is nearly perfect, then it might simply be assumed 
that as other consonants are added eac. 


h consonant interferes to some extent with 
recall of the others. 


The two extreme possibilities in these Procedures are perfect recall and no 


8 task only partially prevents rehearsal, 
ause the learner does have some oppor- 

in that task. Because more rehearsal is 
Presumably necessary for recall of more co 


Melton (1963) 
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Probability of three consonants was greater in the Melton experiment (Figure 
13-3) than in the Peterson and Peterson experiment (Figure 13-2): Melton’s 
learners recited the consonants aloud and were therefore guaranteed at least one 
rehearsal before they began the backwards counting, whereas the Petersons’ learners 
did not do so. One case used visual stimuli and the other used vocal stimuli, but 
the opportunity for immediate responding to the consonants was probably more 
important than the modality difference. 

Once we recognize that each item may affect the recall of other items, we 
may ask how the numbers in the backwards-counting task come to affect recall. Do 
they prevent rehearsal, or do they interfere with recall in some other way? If each 
number produced by the learner during backwards counting is regarded as an item, 
does recall vary with time or with number of items? This problem was addressed 
by surveying memory experiments in which delays between item presentations and 
recall included various numbers or rates of intervening items (Waugh and Norman, 
1965). Data from many studies indicated that the number of intervening items 
during the delay was more important than its duration. For example, learners were 
Eiven a sequence of numbers at one per second or four per second, and then were 
asked to name the number at a particular location in the sequence. Recall 
Probability after four intervening items at four per second was about equal to that 
after four intervening items at one per second, even though the delay between the 
item and the opportunity for recall was about one second in the former case and 
four seconds in the latter. On the other hand, recall probability was greater after a 
delay with two intervening items at one per second than after a delay with eight 
Intervening items at four per second, even though both delays lasted about two 
Seconds. If any rehearsal occurs while items are presented, items presented slowly 
Should permit more rehearsal than items presented rapidly. Thus, how can the 
Preceding outcome be reconciled with an account of short-term memory in terms 


Of the effects of restricting rehearsal? | i 
Still another problem is that recall probability changes over Tle st SEES 

‘8 Short-term memory procedure (Keppel and Underwood, 1962; Kincaid and 

ickens, 1970). For example, the recall probability of a three-consonant item on 


the first trial of a short-term memory session is nearly 1.0 at a delay of either 3 or 

Seconds; over the next three to six trials, the probability of recall decreases, but 
the decrease is much greater when the delay is 18 seconds than when it is 3 seconds. 
un Other words, the relation between recall probability and the delay imposed by an 
Interpolated task, as in Figures 13-2 and 13-3, is absent at the beginning of sessions 
And builds up over several trials. This is an example of proactive inhibition (cf. 
chapter 12): the learning of the consonants on the first trial interferes with the 
Tecall of other TA on later trials. We will have more to say about such 
Proactive effects later. হা | j 
A change in stimulus items can temporarily eliminate the proactive effect in 


Short term memory experiments. For example, recall probability increases substan- 
Hally ial or ations in stimulus items as changing the items 


s or from numbers to letters. Similar 
eff B items (€.g-, with a change in semantic 


of a 
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categories, as from furniture items to food items: Wickens, 1970). These procedures 
may be more relevant to defining verbal classes (cf. chapter 11) than to analyzing 
short-term memory. 

Short-term memory procedures are usually designed to prevent rehearsal 
rather than to encourage it. It is therefore curious that rehearsal has played such a 
substantial role in interpreting the results of such procedures. We cannot specify 
a single short-term memory function, nor can we say confidently what the role of 
rehearsal is. Furthermore, demonstrating the effects of interference among items, 
both within and across trials, makes it inappropriate to speak of short-term memory 
as the passive decay of items. Nevertheless, the behavior of recalling an item after 
its uninterrupted rehearsal is different from recalling the item if there has been no 
opportunity for such rehearsal. This difference, together with the relatively limited 


span of immediate memory, justifies the distinction between short-term memory 
and other types of memory. 


Long-term memory: 
Interference and the nature of 
forgetting 


Short-term memory is studied Over periods of seconds or at most minutes, 
Whereas long- i 


d over relatively longer time 
“term memory. 

©ngterm memory, by Ebbinghaus (1885), are 
the method of savings (cf. chapter টা “or sense-syllab]je lists was assessed by 


1.0 Ebbinghaus (1885) 
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Figure 13.4. Ebbinghaus’s forgetting curve. Ebbinghaus learned and then relearned 
a2 Syllable lists. The time between original learning and relearning varied, and the 
Root upon relearning was taken as a measure of remembering (cf. chap. 12). 
Orgetting was substantial shortly after original learning (the first data point is at 
minutes), but even after 31 days the savings were Still greater than 20%. 


(Adapted from Ebbinghaus, 1885, chap. VIL.) 


on time periods or that they remained intact but were gradually overlaid by or 
idden beneath other memories. (In a more contemporay vocabulary, the distinc- 


tion is expressed as one between decreasing availability of items with time, as items 


Are lost from memory, and decreasing accessibility with unchanged availability, as 


‘tems remain in memory but become harder and harder to retrieve.) | 
These alternative views were the source of various theories of forgetting. 
Theories based upon the passive decay of memories, sometimes called trace 


theories, often assumed correspondences between remembering and hypothetical 
d upon competition among different 


poe in the nervous system. Theories base [ 2 

Mories, sometimes called interference theories, were more likely to rely upon 
environmental and behavioral variables. Nevertheless, the notion of memories as 
traces that faded or decayed over time was implicit in many early generalizations 
€.8., Jost’s Law: if two associations are of equal strength but different ages, the 
Older One will lose strength or be forgotten more slowly than the newer one: Jost, 


Ee? The problem with such generalizations is that what is remembered may vary 
With the criteri i he way in which remembering is measured. 


relearning a list may differ from 


by common learning criteria, the lists 
fe Orgotten at approximately equal rates, even if they differ in difficulty or the 
‘Me to meet the Jearning criterion (Underwood, 1964). সা 
y The effects of sleep on memory provided one way to distinguis between 
“ccounts of memory in terms of decaying traces as opposed to interference. There 
OSE Convincing evidence for learning during sleep (€8., Emmons and Simon, 

» but less forgetting of what is learned while awake occurs during sleep than 
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i TE- 

during an equal waking period (J. G. Jenkins and Dallenbach, Ey পতা 2 
NV i i i han events duri 

i aking were more likely t a 

tation was that events during w I LCE 

i d. Other accounts, owever, § ন 

interfere with what had been learne k a 

THES Such as memory consolidation, which argued that memory ss 

impermanent immediately after learning, and that a period of ET 1961). 

after learning for memory to become fixed or consolidated (e.g., Glic TR af 
According to this theory, various events, could disrupt the conso 


দি idgtion, 
memory, whereas sleep had no effect on or might even facilitate EE 
Consolidation theories were influenced by the phenomenon of remini. 
(Ballard, 1913; McGeoch, 19 


42; see also Kamin, 1957). Reminiscence, most El 
to be observed with incompletely learned materials, is an increase in el 
probability as time Passes since the end of learning, later followed by TE 
decrease in recall called forgetting. Reminiscence has been observed over min 


ects in human verbal learning (but see Tulving, a 
interference, in one form or another, have tended 
dominate the analysis of long. 
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Figure 13-5. Re 
previously learne 
Forgetting Was 
forgetting depe! 
(Adapted from 


call of the ite 


Ms on a list as a function of the number of lists 
d. Each Circle Tepresents data froma different experimental study. 
Breater as the number of Previous lists increased; in other WOICS; 
ded on proactive interference from the learning of the earlier lists. 
Underwood, 1957; Figure BY 
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Words, the important events were assumed to act retroactively: events that 
Preceded recall affected what had been learned at an earlier time. The critical 
finding for analyses in terms of interference was that earlier learning had substan- 
tial proactive effects on what was more recently learned, influencing the forgetting 
of more recently learned material (Underwood, 1957; Underwood and Postman, 
1960). Figure 13-5 summarizes data from a variety of experiments, and shows how 
the forgetting of a list varies as a function of the number of lists learned earlier 
(Underwood, 1957). 

The observation of proactive interference was significant because much data 
On human memory over more than half a century had been obtained from practiced 
learners who had served in experiments involving the learning of many lists. Ebbing- 
haus himself typified that circumstance, and even learners who did not participate 
NM experiments with many different lists under many different conditions were 
Ordinarily given practice lists before the experiment proper began. From those 
Studies, the estimate of forgetting over 24 hours had been about 75%; Underwood's 
analysis showed that most of this forgetting is attributable to the proactive inter- 
ference from the learning of earlier lists, and that without such interference 
forgetting is only about 25% (zero previous lists in Figure 13-5). In other words, a 
Predominant cause of forgetting is that older learning interferes with remembering 


What is more recently learned. 


This analysis showed that the remembering of a verbal list learned in the 


f other verbal lists in the laboratory. 


ere w 


Ould be little forgetting or no forgetting at all if it were not for proactive 
terfey 


ence (in fact, retroactive effects also contribute to ss Et 
ne interpretation of proactive interference assume EA 
e way unlearned during second-list learning, 2 Ue second-list 
ite LCOUS recovery, the first-list items gradually return oF 8 I 

cms (cf. Barnes and Underwood, 1959). The PIAUSIBIY OL SUES OCCT FOS 
teduceq by comparisons of recall and recognition of first-list and second-list items 
A various HE Ae learning, and by findings that proactive interference could 
YAY with relations among diana) items. Proactive interference is a phenomenon, 


in the details of its operation remain to be worked out. In AE 
terference does not explain forgetting; it is a name occasioned by es 


Mt affect forgetting (cf. Crowder, 1976, PP- 194-195). 


in Som. 
Sponta 


Overview: Metamemory as 
differentiated and discriminated 
remembering 


We have identified three memory classes: the persisting stimulus effects called 
iconic memory; the momentary recall, extendable through rehearsal but limited in 
capacity, called short-term memory; and the durable remembering called long-term 
memory. Our iconic memory of a visual item will depend on whether we have 
looked at it, and our short-term memory of the item will be influenced by how bed 
have rehearsed it, and our long-term memory of the item will vary with the ways in 


which we have responded to the item. We need not quibble over whether 
remembering is behavior. What is more im 
tion of iconic memory, 


of rehearsal that exten and mnemonic techniques that 


n to judge the properties of our 
vell, Friedrichs, and Hoyt, 1970). 
Ory are differentiated classes of 


most appropriately, running memory of 
races run, laps completed itions of competitors, and other information 


One experiment (Waugh, 
recall probability of common o. 
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1972) was based u 


Pon the effects of list length on 
ne-syllable words. I 


n free recall, the probability that 
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any item will be recalled decreases with list length. Thus, recall probability is higher 
for the items in a 20-item list than when the same items are the last 20 items of a 
40-tem list. The experimental condition included 20-item lists and 40-item lists. 
In some 40-item lists, an instruction to forget the first 20 items was presented after 
the twentieth item; in others, no instruction to forget was given. With instructions 
to forget the first 20 items, but not without such instructions, the recall probability 
of the last 20 items of a 40-item list was no less than the recall probability of the 
20 items of a 20-tem list. In another condition, some items included among the 
first 20 of the 40-item list were repeated among the last 20 items of that list. Even 
with the instruction to forget the first 20 items, the recall probability for these 
items was greater than that for new items. Whatever the learner did when instructed 
to forget did not cancel the effects of the items to be forgotten; the learner was 
more likely to recall those items when they appeared again in the second half of the 


list than to recall items that had not appeared before. 
Not only can learners learn to remember and to forget differentially, as when 


they are more likely to remember unfinished than finished tasks (the Zeigarnik 
effect: Zeigarnik, 1927); they can also discriminate among various properties of 
their own remembering. The “tip-of-the-tongue”™ phenomenon (R. Brown and 
McNeill, 1966: cf. chapter 10) is an example of a discrimination based upon the 
likelihood of remembering. We can sometimes say that we will be able to recognize 
4 Word even though we are unable to recall it at the moment. Both at the presenta- 
tion of an item and at its recall, learners can estimate the likelihood of remembering 
Ce.g., Groninger, 1976; Hart, 1967). We can also sometimes discriminate between 
never having learned something and having learned and then forgotten it (Kolers 
and Palef, 1976). To say that we are confident or uncertain about something we 
have recalled is tacting still another feature of our Own remembering. Such discrim- 
inations are not themselves memory, but they may well be important components 
Of remembering. For example, the student who cannot distinguish between what 
has been learned well and what has been learne 


d inadequately in a course is not 
likely to be able to study effectively. Thus, the topic of metamemory is important 
in which t 


because it deals with the circumstances he learner will engage in behavior 
relevant to remembering. This behavior, ranging from simple rehearsal to mnemonic 
techniques, is a critical component of both short-term and long-term memory. 


aE METAPHOR OF STORAGE 

D RETRIEVAL 

An episode ing is defined b omponents: the initial learning of 

an item, sles G3 and then and pportunity for the A the item. 

1 the research literature on memory, 2 metaphorical treatment an ese three 

components has gradually evolved into a technical langue i ] I 

Bower, 1073: . Shiffrin and Atkinson, 1969). The initia leaning is 
» 1973; Norman, 1970; w the item is retained 


i i ines ho 
Said to result in the storage of the item, which ee ortunity for recall; th 
OVer time; this period of retention is followed by the OPP ; the 


y three © 
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recall of the item is then called retrieval from storage. An item that has been stored 
is sai ilable, but it is said to be accessible only if it can be retrieved. 
ট ETL are analogous to storing index cards in and then retrieving them 
from a file, or storing information in and then retrieving it from the memory a 
of a computer. The important difference is that we cannot examine any Pp চা 
where remembered items are retained in the way we can examine the contents ণ 
a file drawer or the fine structure of a computer core. Even What is remembere 
remains unspecified. We can examine an index card or the electrical charges 
computer, but we cannot find a remembered item within the learner’s head. 
Although the language of storage and retrieval is an effective analogy for what 
happens in remembering, we must recognize its metaphorical status. In fact, one 
objective of some research on memory is to explore the limits of the metaphor, or 
the range of conditions over which it holds. An analysis of these limits may 
determine the circumstances in which we may appropriately tact the functional 
Properties of remembering in the language of Storage and retrieval. 
By definition, a remembered item is one that has been stored and retained 
and retrieved. The failure to remember an item may occur because the item was not 


stored in the first place, or because the item was stored and then lost from storage 
during retention, or because the item was not retrievable at the opportunity for 
recall. In this section, we examine h 


Ow remembering can be affected by events 
during these three phases. 


Storage: Encoding and levels 
of processing 


What the learner remembers is not so much the stimulus itself as the learner's 
response to it. Th i i 


A more contemporary version of this point is the following: 


UE OMOle WE to coat the occipital lobes of the brain with a special 
নি Surreal VSL St which, when developed, yielded a reasonable copy © 
Stimulus. In many quarters this would be regarded as 2 
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triumph in the physiology of vision. Yet nothing could be more disastrous, 
for we should have to start all over again and ask how the organism sees a 
picture in its occipital cortex. (Skinner, 1963; see also Skinner, 1976, p. 74) 


As in the analysis of stimulus control, the problem of remembering is not to be 
Solved by trying to follow the stimulus into the organism; rather, we must discover 
how to characterize the ways in which the organism behaves with respect to the 
Stimulus. 

The learner’s behavior with respect to the stimulus to be remembered has 
been called encoding (e.g., Melton and Martin, 1972). Consider the following 
experiment (R. Conrad, 1964). In one part, the learner Was to name spoken letters 
Presented in background noise. In the second part, the learner was to write down in 
Order a sequence of six letters that had been presented visually. In the first task, 
errors typically occurred along dimensions of common acoustic properties; for 
ee: V was more likely to be confused with rhyming letters such as B or C 
ৰ With other letters such as N or X. The same types of errors occurred in the 

cond task; errors were again more likely along dimensions of common acoustic 
Froperties than of common visual properties (e.8., for straight lines versus curva- 
Te, written V has more in common than N than with C: cf. Gibson, 1965). Even 
a the second task used written stim সা written responses, the learner is 
Id to have encoded the stimuli acoustically and not visually. We need not assume 
At the learner's responses were subvocal speech; that remains a reasonable 
Possibility, but more important is that the learner's remembering was based upon 


Aco i A 
Ustic rather than visual dimensions of the stimulus letters. OO 
he dimensions along which stimuli are 


The St 
ructure of memory depends on t J 
ৰ favor encoding based on semantic 


enco 
pro ded. For example, some tasks may | 0 - 
Perties (defining technical terms), others may favor encoding based on forma 


aS such as modality (learning spellings or pronunciations), and ৰ eo" 
com ie encoding based on temporal properties (following the শত hee 
ur Position). The context in which a given item appears, the orde. ৰ Sp J s 
jos i Of items, the formal or semantic relations among an Ee ead 
Way Re to the learner all are properties that the te ৰ sy 

i | urthermore, encoding may vary from time to time 50 h bem 
ime; and may be based on combinations of properties কি on Hoge 
bro sions. Tt should not be surprising that We can identify di yp 
Perties of encoding in different types of learning tasks. 


We will consider a few 
“Xam, i ing called substitution 
and Ples, in the context of the two major classes of encoding 

laboration. 


Sim ht . 
Be tay sae i i op 20 Byorse-code learning: cf. F. 5. Keller, 
f the binary digits, 0 and 1 (G. A. Miller, 
101001 substantially exceeds the span of 
mory. Each group of three binary digits, however, can be replaced 
© Octal digit, according to the following list: 


forward encoding, and corresponds 


ediate me 
Ya singl 
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000-0 100-4 
001-1 101-5 
010-2 110-6 
011-3 111-7 


The binary sequence then can be coded as the octal sequence 215351, ae 
ithin the span of immediate memory. Encoding groups of items so লিন 
EE of items to be remembered is reduced has been referred to as chun ! 
The acoustic encoding of written letters can also be regarded as substitution, i 
that there is a unique correspondence between spoken and written letters. 

A second major type of encoding is called elaborative encoding, as when 
CVC nonsense syllable or a sequence of consonants is transformed into a word a 
a phrase. For example, a learner might rehearse the nonsense syllable BOH 2 
“BOTH without a T,” or the consonant sequence QBF as “Quick Brown Fox. 


Such verbal instances of elaborative encoding have been called natural-language 
mediation (Prytulak, 1971). Another Variety of elaborati' 


imagery, discussed in the context of mnemonics (cf. Paivio, 1 
Madigan, 1968). Elaborative encoding does not have the u 
between stimulus items and the learner’s responses that chal 


and thus an encoded item may not always uniquely occasion the appropriate 
response. For example, the learner who encodes the word 


“baggage” according to 
the method of loci by imagining a tower of suitcases standing in front of the 
fireplace might well Say “luggage” at recall. 

Encoding, whether b 


ve encoding is visual 
969, 1971; Paivio and 
nique correspondence 
racterizes substitution, 


not the same in the context of ‘¢ 


jam” (cf. chapter 11). This encoding may affect retrie 
list in the context of “log jam,” i 


in later lists it will be 
in some contexts, such as “traffic jam,’ 


“jam session.” 


val. If “jam” appears in ae 
more likely to be recognize i 
’ than in others, such as “raspberry jam” 0 


মী টু jons 
8 Objects at successive location 
We should not expect an exhaustive listing 
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Buarantee variability in the ways that we each encode stimuli (cf. Anderson and 
Bower, 1973). 

Different types of encoding differ not only in their likelihoods of occurrence 
in different circumstances, but also in the likelihood that what is encoded will be 
temembered. Some kinds of encoding seem more superficial and therefore less 
Memorable than others; this dimension of encoding has been called level of process- 
ing or depth of processing (Craik and Lockhard, 1972; Glanzer and Koppenaal, 
1977). Fewer words from a list are recalled after a task requiring responses to 
Structural properties of words (e.g., crossing out vowels, copying the words, 
counting the number of letters) than after a task requiring semantic responses on 
Assigning Words to categories, discriminating among plant and animal names). The 


‘mplicgtiop js that the greater the depth at which or to which an item is processed, 


the More likely it is to be remembered. We are certainly more likely to remember 
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rather than raises the probability that the learner will recall the remaining items 
(Slamecka, 1969). It is clear that, at least in some lists, relations among items have 
less effect on recall than other properties of the items. 

Retrieval sometimes has been discussed in terms of the metaphor of search 
(Yntema and Trask, 1963). The learner is said to search through the memory store 
for items having particular characteristics until the appropriate item is found. The 
store might be analogous to a stack of index cards in a cabinet or to the memory 
banks of a computer. (The metaphors for human thinking have typically kept up 
with contemporary technologies; thus, in earlier times these metaphors were clock- 
Word systems or hydraulic mechanisms: cf. McReynolds, 1971.) But the metaphor 
has problems in dealing with how rapidly and accurately humans can discriminate 
between knowing something and not knowing it (Kolers and Palef, 1976; see also 
Wason, 1965). Why should search ever take longer when an appropriate item exists 
to be found than when such an item does not exist (and therefore when the search 
must examine every item in storage)? Aside from the problem of how rapidly we 
Can say that we do not know something, the metaphor also seems implausible 
because of the vast capacity of human memory; if nothing else, a search through 
Memory storage at every instance of remembering seems inefficient. (Yet the 
increasing size of the memory store with experience, and thus the greater difficulty 
of finding any particular item, could be used as an argument for proactive inter- 
ference; proactive interference is presumed to act more at retrieval than at storage.) 
Cue.dependency, however, suggests the possibility of a more restricted search, the 
limits of which are defined by the conditions at recall. The metaphor of search, 
though not likely to be adequate in general for accounts of human memory, may 


be relevant to some special classes of memory tasks. f 2 

In general, the implication of cue-dependency is that the more the conditions 
Of recall are like the conditions of learning, the greater will be the likelihood of 
recall. These conditions include not only the to-be-remembered stimuli and the 
learner’s behavior with respect to the stimuli, but also the situation within which 
the learning takes place. The special case of cue-dependency in which the relation 
between the learning and recall situations affects recall has been called state- 
dependent learning. The conditions studied include drug states and physiological 
States (e.g., electroconvulsive shock) as well as experimental settings C.s., Overton, 
1964; A. M. Schneider, 1967). For example, what was learned while intoxicated 
may be more likely to be remembered if the learner is again intoxicated than if the 
learner is sober. L 

Cue-dependencies and state-dependencies may have important practical 
implications. But we may not wish to advise the student Who is preparing for an 
examination to study in the room in which the examination will be held. For one 
thing, what has been learned will be useless after the course is over if the student 
cannot remember it anywhere else. More important is that what is learned will be 
best remembered if many of the conditions for remembering arise out of the 
Subject matter itself. A systematic subject matter is one in which each component 
prompts and is prompted by other components. To outline the subject matter, 
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therefore, is to create an encoding that will determine how well the various parts 
are remembered. 


THE STRUCTURE OF MEMORY 


We have considered various properties of memory. Many kinds of events and 
relations can be remembered. For example, the Pigeon that pecks a key today 
because pecks yesterday Occasionally produced food can be said to be remembering 
the past contingency between pecks and food. We have already questioned whether 
remembering of this sort is related in any simple way to remembering in human 
verbal learning. The vocabulary of memory may be occasioned by almost any situa- 
tion in which current behavior is influenced by past events. We remember particular 
incidents, word definitions, contingencies, stimuli, language syntax, and our own 
behavior. Given this variety, we might expect properties of memory to vary with 
what is remembered. An example is motor memory (e.g., Baddeley, 1976, chapter 
10). Two classes of motor memory have been distinguished: discrete skills, such as 
typing or changing gear in a manual-shift car; and continuous skills, such as tracking 


olhs ion briefly surveys two classes of memory 
distinguished by what is remembered. 


Episodic and semantic memory 
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Temembered rather than with the functional properties of such remembering (cf. 
Anderson and Bower, 1973; Shimp, 1976). Some relevant research was presented 
'n chapter 11, in the context of psycholinguistics. For example, a learner is more 
likely to remember semantic than syntactic properties of a sentence in a text 
(Sachs, 1967). Remembering a particular sentence that appeared in a particular text 
i episodic memory. But semantic structure is defined by how we generalize across 
Syntactic transformations of sentences; one way we determine the nature of 
semantic ang syntactic classes is by examining what is remembered. The corres- 
Pondence between what is presented and what is remembered (recalled or 
Tecognized) gefines structural classes in memory in the same way that the 
“Orrespondence between the behavior that is reinforced and the behavior that is 
Benerated defines the structure of operant classes (cf. chapter 6). In other words, 
“sing Iemembering to study the structure of verbal behavior might be more profit- 
ble than using verbal behavior to study the nature of remembering. k 
A Variety of experiments have examined the structure of what is remembered. 
“Analyses of how stories and texts are remembered have led to accounts of the 
‘etarchica] structure of components of the story plot or text (e.8., Bower, 1976; 
“oon, 1977), Analyses of the remembering of words and texts in different 
guages have shown that the structure of the semantic classes of bilingual speakers 
extends Across language boundaries, as when the learner’s remembering of neering 
N independent of the language in which items are presented (e.2., Kolers, 1966; 
Od, 1976). Analyses of the implications that learners can derive from Ee 
aE and logically related sentences have shown how items of ER 
penized into an integrated structure (e.g., Moeser, 1976; Moeser and Ta ন 
* Such ies inevi bine episodic and semantic components of 
me Studies inevitably combine eplis th specific dated items and 
oy because they include the remembering of both speciti F PS 
ener] Structural relations, but they show also how an analysis of remem g 
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FOURTEEN 


EL root gno- is the source through Old English of know and 
able tS Er and through Germanic of cunning and can, in the sense of being 
arid this Its Latin derivative is gnoscere, to know or to get acquainted with, 
knowled is a root of cognition, ignorant, recognize and notice. Synonyms of 
Buropean are often related to sensory language. For example, the Indo- 
Leen, tho to see, leads to view and vision through the Latin videre, to 
English ugh the Greek eidos, and to guide and wisdom through various Old 

and Germanic forms. The word see itself implies understanding, as in 


the 
Phrase [ see, and has such relatives as the word insight. 


r treatment of the effects of stimuli and 
uctural and functional problems in the 
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definition of the structural units, then make those units in turn the basis for a more 
rigorous description of function, and so on. 

We indicated, in our initial discussion of function and structure, that concern 
with functional problems tends to be correlated with a behavioral vocabulary, 
whereas concern with structural problems tends to be correlated with a cognitive 
vocabulary. In this text, we began with an emphasis on the experiments and 
vocabulary that had grown out of the behavioral tradition. The behavioral 
treatment was extended to verbal behavior, which began the transition to POPES 
that are the primary concern of contemporary cognitive psychology. Those topics 
were examined in the context of Psycholinguistics, verbal learning, and memory. 
This chapter further illustrates research methods and issues in cognitive psychology; 
it then proceeds to a brief treatment of problem-solving, in which structural 


analyses converge with the functional Properties of stimulus control and contingen- 
cies. 


COGNITIVE PROCESSES 


We already have spoken of some classes of behavior called cognitive processes (e.8., 


imagining or Visualizing). Cognitive Processes are private events: we cannot see what 
Someone else is thinking or imagining. But imagining, like walking or talking, is 
something we do. Some might argue that the different dimensions of these private 
events do not justify calling them behavior. Yet our treatment of behavior has not 
been limited to particular Movements; operants, for example, are not defined by 


response topography. Certainly it is difficult to say j 


ই I allucinations; cf. Skinner, 1953; Paivio, 1975). The diffi- 
culty is that this behavio ible to anyone but the one who 
he recommendation to convert Psychological nouns 
Instead of “cognition and thought,” we should say 
SO Malcolm, 1971). 
nt called paying attention or attending to something. 


ttention as the control over responding by some 
not others. That treat 
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demonstrated in the history of psychology. Introspection was not reliable. 
Different individuals, and the same individual at different times, reported events in 
different ways. Attending presented particular difficulties: how does one attend to 
One’s attending? The difficulties were inevitable, because the language of private 
events must be based at some point upon what is publicly accessible to the verbal 
community (cf. chapter 10). A private event is available only to the person behav- 
ing. This does not mean that private events are outside the boundaries of a 
behavioral analysis; it does imply, however, that We learn the language of private 
events on the basis of consistencies in the relations between Private events and 
Public events. Imagining an object, for example, must have something in common 
with seeing the object. REE % imuli 

In any case, the response of attending is not a movement. For visual stimuli, 
it may superficially seem analogous to looking toward or even Pointing at some- 
Shing, But iF attending, were Sintply equlve ent CT ONS ITS PE ations Th 
Which we look without seeing would be excluded. We do EES with out 
Seeing, however, as when we are daydreaming or lost » HE 
then say that we were not paying attention. The behavior a SE A ন 
also share something with what we do when we say we are NE aR hn 
for something. Such behavior has been called A 2 annie Le 
Bruner, 1957). For example, what we notice or EEG RTE EE 
to find a name will not be the same as if we are scanning to ি NEASINab]e by OTE: 

We have already seen that when a response I the behavior 0 as its 
ment, we may be able to measure other propels extended to the response 
duration or its latency (cf. chapter 11). This OE ER 
of attending by Sperling and Reeves 0 a int, letters were presented in 
fixation point in the visual field. To the left of this poin TEE Ls Sais 
Tapid sequence one at a time. To the right, a sequence 
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indirect, but varied experimental techniques for the study of imagery have been 
explored (Paivio, 1971). Here again, temporal measures have been particularly 
effective. For example, observers were shown pictures of pairs of three-dimensional 
figures, with instructions to report whether or not the figures were the same 
(Shepard and Metzler, 1971). When the figures were the same, one was rotated 
relative to the other, as illustrated in Figure 14-1. For such figures, the latency of 
the report that they were the same was linearly related to the difference in orienta- 
tion, in degrees of rotation. In other words, mental rotation, or rotation of an 
image, has some of the same properties as the rotation of an actual object: in both 
cases, the time taken to perform the rotation is proportional to the distance 
through which the object is to be rotated. But by treating the image as a thing we 
must not be mislead into using it to explain behavior; the imagining of the object, 
like the rotation itself, is something the observer does. The temporal properties 0 
rotating real and imagined Objects are similar because of the correspondence 
between what the observer does when seeing an object and what the observer does 
when imagining it. Imagining is visual behavior in the absence of the visual stimulus. 
It has been Suggested that Watson, the founder of behaviorism, may have 
denied the existence of images because he himself was incapable of visual imagery 
(Skinner, 1959). The recognition that such a denial was irrelevant to a behavioral 


analysis was slow in coming. Whether the image has some physical status inside the 
Observer's head is beside the point. The adv: 


ন antage of saying that we imagine an 
Object over saying that we have an image of the object is simply in its emphasis on 
A. 


1 » the tw, 
rotation. (From Shepard and Metzler, 19 
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What we do. What is at issue is not the status of images, but rather the role of 
such private events (or cognitive processes) in accounts of behavior. Our primary 
concern must be to determine their own properties rather than to base accounts 
of other kinds of behavior upon them. A behavioral account need not deny the 
existence of such events, but it limits the conditions under which such events can 
Properly serve as explanations of behavior (cf. Paivio, 1975, Pp. 287). 
K The example of mental rotation used visual stimuli. The response of visualiz- 
ing also occurs in the absence of such stimuli, as we saw in the case of visual 
encoding as a mnemonic technique. In such cases, there is even more temptation 
co speak of the image as a thing seen rather than of imagining as behavior. In the 
Photographic” memory of someone with cidetic imagery, for example, reports of 
the details of past scenes are accompanied by eye movements similar to those when 
Scanning a current visual scene (e.g., Haber, 1969). Yet to say that the reports or 
the eye movements are caused by the remembered image does not explain anything; 
an account of imagining must be derived from the behavior of seeing and not from 
the stimulus properties of a private event. 
Consider some additional research on Vi 
Moyer, 1973), observers were given pairs of wo 
and were instructed to choose the larger. The great 
the two named classes, the shorter were the reac 
truck.cat was faster than that to book-chair). The 


Systematically in size, and the relation between siz Tene 
Was similar to that with pictures rather than words as stimuli. These and related 


experiments (e.g., Paivio, 1975) suggest that this task involves imagery occasioned 
by Words; such behavior is presumably analogous to the visual encoding discussed 
‘1 chapter 13. 


isual imagery. In one experiment 
rds (e.g., mouse and elephant), 
er the difference in size between 
tion times (e.g., the response to 
words themselves did not differ 


«ww, In another type of experiment, learners are given pairs of statements such as 
is taller than B” and “A is shorter than C,” or “X is to the left of Y” and 
is to the right of Y,” and then are instructed to indicate whether such 

Statements as “B is taller than C” or “Y is to the left of Z” are true or false (e.g., 


Brooks, 1968: Huttenlocher, 1968; Scholz and Potts, 1974; see also Huttenlocher 
And Strauss, 1968; Moeser and Tarrant, 1977). Latency measures and the differen- 
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increasing the time between the Presentation of the scene and the observers 
description. Systematic studies of such Phenomena, however, are not available in 
the research literature. 


Processing stages 


As we have seen, temporal measures have been an important feature of the 
analysis of cognitive processes. Besides showing that Private events take time, some 
Cognitive analyses have been concerned with demonstrating how tasks can be 
decomposed into their Separate components or Stages. 


ing questions about mental 0 erati 
1969, p. 421) 


oe রশ DE ‘ed to identify particular stages, such as a discrim- 
foinithe eur an tracting the Simple reaction time to a single visual stimulus 
Soh gt ate reaction time to one of two visual stimuli). At various times, 

ECS as stimulus registration, decision, and response initiation were 
problems, however, complicated stage 
Stage durations determined in different laboratories and even 


he same laboratory were highly variable, presumably because 
erformed in a varie 
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Presented visually, the observer’s instructions were to press one button if the digit 
Was a member of the positive set and another if it was not. The latency from digit 
Presentation to button press was measured over successive conditions in which the 
Number of digits in the positive set was varied. 

How might this task be performed? To speak of search is in this case a meta- 
Phor, but what properties should such a metaphorical search have? The observer's 
Tesponse is determined by whether the presented digit matches one of the digits in 
the positive set. One stage of this task, therefore, might consist of comparisons 
between the presented and the positive-set digits. Let us begin by assuming that the 


yp nParisons are made one at a time instead of all at once, or, in other words, that 
he observer engages in serial search instead of parallel search. Serial searches may 
search the positive-set 


a in at least two different ways. First, the observer might siti 

Bits either in a varying and random order or in a fixed order (e.g., beginning to 
end); we will speak of random-order and fixed-order search. Second, the observer 
May discontinue comparisons as soon as a match is found, or may complete all 
comparisons before attending to matches; the former is called self-terminating 
Search ang the Jatter exhaustive search. The various types of search have different 


u je. . Fl খ্থ e 
antitative implications, as illustrated in Figure 14-2. 
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Consider first the relation between reaction time and list length. In EL 
search the presented digit, whether in the positive set or not, is compared wit i 
the positive-set digits. For example, if each comparison takes 50 milliseconds, e id 
item added to the positive-set digits should add this duration to both positive Fs 
negative reaction times (negative reaction times are those to digits not in 
positive set); the slopes of these two functions in this case will be equal (the io 
functions will thus be parallel, but they might differ by a constant if positive 
responses take more or less time than negative responses). With self-terminating 
search, however, all comparisons will be made for negative digits, because no 
matches will be found, whereas for positive digits a match will be found, on the 
average, after only half of the comparisons. In this case each digit added to the 
positive set will add 50 milliseconds to negative reaction times but only 25 milli- 


seconds on the average to positive reaction times; the slope for positive items will 
therefore be half that for negative items. 


gh specific, and even 0 
addeley, 1976; Crowder, 1976). More complex variations [) 
2 example, produced reaction-time functions characteristic 


A g ating 
sin addition, the functions rel 


Sternberg (1966) 
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n times for positive and negative items as a function of 
ositive set, for eight observers. Errors (usually within 1 
f Juded from the data. Reaction time (RT) as a 
unction of number of items (N) is roughly described by the equation: RT = 400 
N, where RT is in milliseconds. (From Sternberg, 1966, Figure 1.) 
in the positive list were consistent with 
and fixed-order search for others (Stern- 
hor of search is strengthened by the 


Bigure 14-3. Mean reactio 
or 09 BET of items in the p 
% of total responses) were exc 


oo times to position of the item 

L Ndom-order search for some observers 

CIg, 1969; cf. Figure 14-2). The metap 

Quantitative detail of the data, but its range of application is limited (e.g., different 

effects may be obtained if the number of items in the positive set exceeds the span 

of immediate memory; for this reason, the metaphor has not been extended to 
Y the metaphor is based on the 


ory in general: cf. chapter 13). PUREE 
SsUmption of serial search rather than parallel search, in which items are compared 


Smultancoy sy instead of successively. Parallel search need not be restricted to 
fixed amount of time regardless of the number of 


cases in which the search takes a 

isht become slower as the number of items 
n account in terms of parallel search that 
r serial search (e.g., Townsend, 1971). 
e advantage of this type of approach is in its 
jls of performance. An adequate description 
tages will necessarily depend on the com- 
solation (as in the example of 
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attending with which this section began) and those concerned with the dynamics of 
more complex tasks (as in this instance). 


Mental representations 


In their emphasis on the temporal properties of attending and visualizing and 
searching, these treatments primarily have dealt with defining the structure of 
private events. They have been less concerned with functional issues such as the 
conditions that occasion these events or the consequences they may have. But, aS 
we saw earlier, there are other aspects of structure, such as sequential and 
hierarchical organization (e.g., consider the sequential and hierarchical components 
of completing a college major: the major consists of individual courses the order of 
Which may be constrained by prerequisites; the courses may be decomposed into 
smaller units consisting of assignments and examinations; these in turn may be 
further reduced to reading particular passages or answering certain questions, again 
perhaps in specified orders; and so on). Treatments of concept learning and 
Strategies in problem-solving sometimes proceed by showing how correspondences 
may develop between the structure of the environment and the structure of 
behavior (e.g., Garner, 1974; Haygood and Bourne, 1965; Restle, 1962). 

The development of cognitive structure has been a central theme in the work 
of Piaget (e.g., Piaget and Inhelder, 1969; see also Fischer, 1975; Ginsburg and 
Opper, 1969). We can provide here only a brief and inevitably oversimplified 
discussion. The several phases of child development in Piaget’s system (sensory- 
motor, preoperational, concrete Operational, and formal operational) correspond to 


a progression from relatively simple relations between motor responses and their 
Consequences (as in reaching for and manipulating objects) to complex selakior 
that depend both on corre 


on co Spondences between verbal and nonverbal behavior and 
correspondences between environmental structure and behavioral structure. 


Piaget speaks of the development of such correspondences in terms © 
the child must accommodate to the constraints 
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The phenomenon called conservation illustrates some aspects of cognitive 
Structure dealt with in terms of representations. If the water in a short wide beaker 
is poured into a tall narrow beaker, a preschool child is likely to report that there 
is more water in the second beaker than there was in the first. This report is based 
On the higher level at which the water stands in the tall narrow beaker than in the 
short wide one. In Piaget's account of development, this type of response is 
characteristic of the preoperational phase; one property of this phase is that 
responding is based upon single properties of environmental events rather than 
upon relations among properties. Later, in the concrete operational phase, the child 
Says that the amount of water is the same in one beaker as in the other. Such 


reports are restricted to specific instances, An e 
Operational phase that the child begins to speak abstractly of these relations as the 


Conservation of matter. 

One way to deal with this phenomenon is to attribute the increasing sophisti- 
Cation of the child’s performance to successive refinements in the child’s mental 
tepresentations of events in the world. As the child moves from actually manipulat- 
ing objects to imagining or talking about or thinking about manipulating those 
Objects, the structure of the child’s mental representations are said more and more 
closely to approximate the structure of the contingencies that operate 3 for 
Manipulating objects in the world. But the structure of these mental representations 
Was derived from the correspondence between behavior and environmental 
Contingencies. It would therefore be inappropriate to assume that the behavior is 
explained by these representations. The representations do not exist in the child in 
any but a metaphorical sense; instead, they exist in ou ie discriminative behavior 
AS we Observe the child, and correspond to the consistencies and correspondences 


that we observe in her or his behavior. 
The development of conservation i 

With Properties of behavior that we have alr 

Must occur in parallel, and each is part of 


Classes | impler classes. At first, the child’s 
of d upon Simp টু i tis 
i aia Hert lr Je dimension of objects (in this instance, 


discriminat; on a Sin 

eight); চপ টিপক hr UU Vo or three dimensions (area and Cn) 
come later. At the same time, the child’s vocabulary of tacts becomes more inely 
differentiated (from big-little to bigger-smaller and then to taller-shorter, fuller, and 
So on). Tacts under the control of particular instances become the foundations for 

¢ More general classes of tacts we call abstractions. And so LE 
Or not the quantity of water in one container equals that in anot er is not the same 
as Choosing the contents of one or the other container or filling two containers to 
equal volumes, and none of these is equivalent to 8 statement of the principle of 
Conservation of matter. On the one hand, it is important to recognize how very 
ifferent these classes of behavior are; On the other, it is important to eile 
at consistencies across these different classes are inevitable তা of the 
Ay in which verbal and nonverbal behavior is related to events in the world. 


n children is in any case not incompatible 
eady considered. Several progressions 
a hierarchy in which more complex 


PROBLEM-SOLVING 


We have examined only a few samples of research in cognition. Cognition is, after 
all, not learning; yet many of the issues in cognition are closely related to problems 
in learning. As the last experimental topic in this text, we turn briefly to problem- 
solving. Problem-solving has Sometimes been treated in the context of cognitive 
approaches, with emphasis on the Structure of problems. An analysis of problem- 
Solving also raises functional questions, as when we are concerned with conditions 
that may make the solution of a Problem more or less likely. 

Let us begin with a struct 


Solver’s solution. 


In what way, however, can the Structure of the solution to one particular 
Problem contribute to the solution of oth 
ith teaching problem-solving skills NN 
tics (e.g., Wertheimer 1959). Analyses mus 


transfer from one problem to another. 
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the same way, whereas the second group simply filled A and then poured out 
this an example of negative transfer or EE Co 
but it may be more important that this is Obviously a case in whic | 
behavior is determined by the consequences of past behavior. The litera u F 
ations on such situations (e.g., Birch or 
Rabinowitz, 1951; Duncker, 1945; Saugstad and Raaheim, 1960; Weisberg an 
| Olvers who had been instructed to mount a 
OX of candles and a box of matches and a box 
S, matches, tacks, and boxes (Duncker, 1945). 
the wall so that it provided a flat surface on 
- Those who received the candles and matches and 


ing can be affected by discrimina- 


These experiments show how problem-solv i) 
tive stimuli and What then constitutes problem-solving 


by contingencies. But 
Consider an example from Skinner: 
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by discriminative stimuli and contingencies, we have come nearly full circle. The 
experimenter interested in problem-solving might start simply by watching the 
behavior of the problem-solver. Like Kohler in his study of insight in apes, the 
experimenter will discover what behavior the problem-solver is already capable of. 
That is where our systematic behavioral treatment began, with the simplest 
behavioral operation: observation. We may now leave the experimenter to studies 
Of stimulus presentations and contingencies and stimulus control. 


Conclusion 


FIFTEEN 


The word psychology comes from the Greek psukhein, to breathe, and 
psukhe, breath. Thus, when the psyche was said to leave the body of a 
mortally wounded warrior in the Homeric epics, the word may be understood 
to mean the warrior’s breath and not his soul. A curious parallel exists 
between this word and spirit, from the Latin spirare, another word meaning 
to breathe. There is no evidence that these Greek and Latin words are 
etymologically linked. A similar relation between air and spirit exists in the 
Latin anima, originally a breath of air but later soul or spirit in such words as 
animate and animosity, and in the Greek atmos, vapor or air as it enters into 
the English atmosphere but breath or soul in the related Sanskrit atman. 


We have considered the varied phenomena of learning. We began with experimental 
Operations. The first and simplest was the presentation of stimuli. Stimuli produce 
responses, and the effects of stimuli on the probability of responses can be 
modified by the temporal patterning of stimulus presentations. We also saw that the 
stimulus-presentation operation could have other effects: stimuli can affect the way 
in which responses are ordered in time, in the classes of behavior called interim 
and adjunctive, and the elicitation of responses ey affect their subsequent 
emission, in the phenomenon historically called exercise. 

Just as responses may follow stimuli, stimuli may follow responses. We there- 
fore dealt also with the effects of contingencies Or the consequences of responding, 
in treating the consequential operations of reinforcement bes Punishment. The 
effects of these operations led us to distinctions among posifive reinforcement, 
Positive punishment, negative reinforcement (or escape and avoidance) and negative 
Punishment (or omission procedures). In exploring these procedures, we noted the 
importance of some consequences, less dramatic than A biologically significant 
Stimuli as food and water, that affected an Organism 5 aE Ys bs NE 
Ment. In this context, we examined sensory-motor learning, visual fixation, latent 
learning, and other phenomena not usually classified according to the effects of 


teinforcers or punishers. 
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We distinguished between reinforcement and punishment by hi 
response consequences raised or lowered the probability of responding, bu a fess 
recognized that distinguishing between the positive and negative cases 0. a 
operations was often arbitrary. As in reinforcement by a change in temperature, ৰ 
sometimes could not consistently treat the positive and negative cases in terms (0) 
whether response consequences involved presenting or removing stimuli VE 
presenting heat or removing cold). Instead, we treated these cases in terms 0 
temporal relations between reinforced or punished responses and the other 
responses generated by these consequential operations. We noted that stimuli could 
have multiple effects, and that a stimulus that served as a reinforcer for a response 
might also have eliciting effects on that response or other responses. Thus, the task 
of an experimental analysis is to separate such different stimulus effects and to 
Observe how elicitation and consequences can combine to determine the 
characteristics of a performance. 

We also turned to the signalling effects of stimuli, in the operation of stimulus 
control. This operation can be superimposed either on the consequential operations 
of reinforcement and punishment, when we speak of discrimination learning, or on 
the operation of stimulus presentation, when we speak of respondent conditioning. 
Within these contexts, we explored a variety of behavioral phenomena and 
procedures: attention, stimulus-control gradients, reinforcement schedules, sensory 
preconditioning, autoshaping, and conditioned Suppression, to mention only a few. 

These operations seem to exhaust the Possible behavioral relations that can 
be established among stimuli and responses. Our concerns therefore shifted to a 
different problem: characterizing the dimensions along which stimuli and responses 


Vary. In discussing the processes of differentiation and discrimination, we saw that 
the relevant dimensions included not on! 


Y relatively simple ones such as topography 
Or intensity, 


but also complex ones involving the structure of stimuli and responses 
and the relations among them. We found that the correspondence between the 
classes of responses that had particular consequences and the classes of responses 
generated by those consequences was critical to defining behavioral classes, and we 
distinguished between classes defined by response properties, 
Operants, and classes defined by stimulus Properties, called discrimin: 

In our examinations of concept formation and verbal behavior, we were forced to 
conclude that behavioral relations were primary, and that we could not appeal 
simply to physical measures to define such classes. There are responses, like attend- 


ing, remembering, imagining and thinking, that are not easily observed directly, 
but we decided early that it would be inappropriate to define behavior solely in 
terms of movement. 


simply called 
ated operants. 
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assumed that all learning was based on some one process or some small number of 
processes that act across all organisms and procedures. 

The psychology of learning evolved through the examination of the different 
experimental outcomes of various behavioral procedures. As each procedure was 
considered in its historical turn, it was assigned an importance in proportion to its 
demonstrated effectiveness. At different times, the available experimental findings 
led to theoretical formulations dominated by laws of association or contiguity, 
rules of respondent conditioning, or principles of reinforcement and punishment. 
Sometimes this domination was so substantial that one or another process was 
presumed to be the fundamental and exclusive basis of all learning. Such formula- 
tions were inevitably open to challenge and controversy, because the phenomena of 
learning could not be accounted for exhaustively by any single process. As learning 
theories became more precise, they were necessarily accompanied by refinements 
and qualifications restricting the range of phenomena to which the theories could 
be applied. Recognizing the boundary conditions for learning Was implicit in these 
restrictions. Ee 

Limits on learning pose no problems when they can be easily traced to an 
Organism’s sensory or motor capacities. For example, we are not surprised if certain 
stimuli are more likely to produce responses in some species than in others. We 
know that the visual and auditory systems of pigeons and bats make pigeons 
capable of visual discriminations impossible for bats and make bats a of 
auditory discriminations impossible for pigeons. We gy i sete be 
different capacities for responding. Pigeons and bats fly di লী y bec he 
anatomical structures of their wings are different. Althoug ত ot 
Obvious, they are not trivial. They illustrate eae! we take the differen 

iti ifferent specie . ; fo 
line feet Sra, tie effectiveness of rent a 
reinforcers or punishers varies across species. Once we whl NTS 
of reinforcement, it was no longer re আসল লি Tes consequences can 
Or which punishers might be effective fort § that produce them. Within a single 
only be defined in relation to the response 0 oe effective for others, 


ma! 
Species, a reinforcer effective for ছল PE cdgyerion and acknowledged by the 
As demonstrated by phenomena suc 


j imits on learning in terms of 
t specify the limi ning Y 
concept of preparedness, We canno its in terms of 
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the kinds of relations that can be established between - wa ant 
given sDecies. Organisms may be predisposed to learn differen ৰ 
pecies. Orga 


j 3 an A situations. ৰ 
faa” or” Dn ইপাৰ various experimental operations are effective 
e boundaries wi 


in chansing behavior do not alter the properties that define behavioral processes. 
Anging ehavior 


It doe follow that we shou {la phenomenon bya el 
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SOR rl CG discovery of conditions that limited its applicability. 
e challenge 
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function dimension remains a basis for distinguishing among different research 
Problems. 

We indicated in chapter 1 that in the evolution of psychology the distinction 
between structural and functional approaches became correlated with and 
eventually confused with another distinction, that between the two psychological 
languages of cognitive psychology and behaviorism. The cognitivist prefers to 
Summarize the organization of the organism's environment and its behavior in terms 
Of structures that the organism knows, for which the language of mind and the 
mental is convenient. The behaviorist prefers to anchor accounts of action to the 
details of functional relations among observable events, to which the language of 
Stimuli and responses is appropriate. Yet the relations between stimuli and 
Tesponses are implicit in the cognitive vocabulary, because these relations are 
Summarized in the correspondence between environmental structure and behavioral 
Structure that is intrinsic in structural assumptions. Similarly, structure is implicit 
7 the behavioral vocabulary, because it is summarized in the Properties that 
functionally define the response classes called operants and discriminated operants. 

Biology did not have an equivalent problem: the debate over mechanism versus 
Vitalism, perhaps analogous to that betwee n and mentalism, remained 


n behaviorisi h 
1 ny and physiology: Hein, 


dsonably i istinction between anator 
y independent of the distinc chology, the respective 
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correlations of an নি! positions with structural and functional 
Problems made it difficult to recognize that the problems were different and ক" 
Ore that these research areas might be complementary rather than mutualy 
exclusivy, €. 

When the biologists of an earlie 
Ins were not so much with whet 


© distingyi b i 
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Whether one or the other problem should be given priority. The arguments were 
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between learning and performance, but this distinction merely served asa a 
asserting the primacy of structure over function. Yet what is structure a 
situation? The performance is a particular sequence of turns, and is structure 
this sense. To say that a structure was learned, however, is not to say that the 
structure was a cause of learning. A theorist who wanted to reinstate the primacy 
of function over structure might then argue that the structure is learned because of 
the contingencies: particular turns at Particular choice-points led to particular new 
locations in the maze, and the rat learned these functional relations even without 
food in the maze. Simply in negotiating the environment, the consequences of 
behavior matter. Then comes the argument that if contingencies define what is 
learned, they too cannot be a Cause of learning. And so it goes. The theorist arguing 
for the primacy of structure was probably a cognitivist, and the theorist arguing for 


the primacy of function was Probably a behaviorist, and their different languages 
were not likely to help matters. 


Yet these structures and fun 
learning. The Problems can be resol 
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combinati Y 

EE oT EET for convenience. This may not be a cheering prospect; 

te east be freed from the vain search for the undiscovered and 
able essence of the term species. (Pp. 485) 
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could Es of evolution, the relations among populations of organisms 

males and er adequately in terms of similar topographies (for example, 

Some pairs of es within some species differ more from each other in form than 
organisms within completely unrelated species; the social insects in 
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account of টী achievement, in fact, was more description than explanation. His 
evolution ciao did not depend on any theory Specifying the mechanics of 
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Skeptica] ab e properties of evolution (Gould, 1975, P- 824). Darwin himself was 
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behavior. We sometimes even offer an account of the Origins of responses as an 
explanation of behavioral phenomena (e.g., as when behavioral contrast is analyzed 
as a combination of operant and elicited responses: chapter 8). But we were not 
satisfied; we did not recognize how much had been accomplished. 

In the analysis of behavior, we deal with populations of responses. These 
populations are sometimes not well defined. The problem is not different from 
Darwin’s. Darwin clearly recognized the arbitrary nature of the concept of species, 
but unambiguous definitions of Species were no more critical to his account than 
unambiguous definitions of stimulus classes or Iesponse classes need be to a 
behavioral account. When we distinguish among words by the circumstances in 
which they are uttered (e.g., “Fire” as a mand, a tact, an echoic, or a textual 
response: chapter 10), we are simply distinguishing among classes of verbal 
responses on the basis of their origins. Ambiguous cases will necessarily occur 
because, just as Organisms have many ancestors, responses have many origins. But 
there would be little need for an analysis if that were not the Case. 

One conclusion to Which this account leads is that behavior is best 
represented not by all-or-none Categories but rather by various dimensions along 
Which processes can be located. Reinforcement and punishment are extremes on a 


range from those that increase responding, 
Sponding, to those that decrease responding. 
t are extremes on a continuum of relations 
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least adheres closely to what is done and what is observed in experiments on 
behavior. Whatever else is incorporated into any future psychology of learning, 
these operations and processes must provide its foundations. 
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Conditional reflex, 32, 40, 42, 45, 134, 199- 
215 
Conditional response (see CR) 
Conditional stimulus (see CS) 
Conditioned aversive stimulus, 61 
Conditioned emotional reaction, 206 টঃ 
Conditioned emotional response (CER), 215 
Conditioned reflex (see Conditional refed) 
Conditioned reinforcement, 61, 186, 190- 
192 
Conditioned suppression, 215, 354 
Conditioning, 31-32, 63, 75, 199-221, 226, 
360 
backward, 203-204 
and consequences, 205-208 
defensive, 206-207 
delay, 202-203 
differential, 203-204 
second-order, 208-210 
semantic, 248 
simultaneous, 202-203 210 
not stimulus substitution, 204, 210, 
220 
temporal, 203-204 
trace, 202-203 
Conjoint schedule, 187 
Conjunctive schedule, 185-187 
Consciousness, 167, 316 
motor theory of, 234 
Consequences, 24, 29-37, 56-57, 60-115, 
117,271 
in conditioning, 205-208 
natural, 78, 246-247 
in paradigm, 222-223 
and stimulus control, 32-37 244 
and verbal behavior, 235, 237, ll 
278 ces) 
Consequential operation (see Consequen 
Conservation, 347 
Constituents, 256, 258-262, 267 athe Of 
Constraints on learning (see Limits 
learning) 
Contiguity, 63, 177-180, 282, 355 
and conditioning, 204-205, 214, 219 
Contingency, 63, 82, 334, 360 
in conditioning, 204-205, 214 
defined, 82, 222, 312 
and extinction, 82-83, 112-113 
and instructions, 247 
mnemonic for, 312 
paradigm, 221-226 
in problem-solving, 350-351 
in Stimulus-control, 142, 222-223 
three-term, 22, 32-33, 165 
Continuity-discontinuit » 287 
Contrast, behavioral, 141 186-190, 196 
360 
and autoshaping, 190, 214 
Copying, not transcription, 233 
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Correction procedure, 158 
Correlation: 
in conditioning, 204-205, 215, 219-220 
in schedules, 177-180 
CR, 201-203, 208-215 
Creativity, 330 
Critical features, 27-28, 156 
Critical period in imprinting, 59 
CS, 201-204, 208-215 
Cue-dependency, 308, 331-333 
Cumulative record, 70-73, 80, 81, 85 
VI and VR, 169-170, 172-174 
Curiosity, 89 
CVC-trigram, 282, 287 


Defense reaction, 110 
Delay: 
changeover, 193 
conditioning, 202-203 
of reinforcement, 105, 178-179, 181, 
195 
Demand characteristics, 278 
Deprivation, 60-61, 77-78, 98, 199 
and generalization, 147 
sensory, 25 
Descartes, R., 40, 42 
Description, 15, 75, 91, 232, 329, 359 
Determinism, 13 
Development, 357, 359 
language, 268, 274-275 
stages of, 346-347 
Dictation (taking), 234-235, 236, 301 
Differential reinforcement, 118-127, 136- 
137; 155 
of IRTs, 175-177 
of latency, 127-129 
of pauses, 179fn 
schedules of, 129-131 
in stimulus control, 141, 143 
of successive approximations (see Shap- 
in 
Dien el orosnentorale schedules 
(see DRH, DRL, DRO, DRP) 
Differentiation, 118-127, 139, 142-143, 
277, 360 
and metamemory, 324 
vocabulary, 155 
Dimensional control, 144-145, 147-149, 
151 
Discriminated operant, 139-143, 155-160, 
163-165, 358 
Discrimination, 32-37, 133, 139, 142-165, 
226, 360 
conditional, 240, 293 
errorless, 154 
and experience, 150-151 
feature-positive, 153 
and metamemory, 324 
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and multiple schedule, 186-190 
reversal, 303 
simultaneous, 35, 292-293 
Successive, 35 
and transfer, 303 
verbal (see Verbal discrimination) 
vocabulary, 155 
Discriminative stimulus, 32, 141 
and problem-solving, 350-351 
punishment as, 102-103 
reinforcer as, 83 
Distinctiveness, 145, 290-291 
Distractor items, 296 
Dog, 30-32, 43, 4648, 57, 121, 200-212 
Double alternation, 132, 137 
DRH schedule, 130, 184, 218 
Drinking, 19, 49, 77, 99, 160 
in autoshaping, 214 
DRL schedule, 130-131, 143, 167, 175- 
176, 183-185, 217-219 
DRO schedule, 131, 185 
DRP schedule, 131, 185, 218 
Duckling, 59-60, 151 


Ebbinghaus, H., 323 
Echoic behavior, 231-232, 236, 240, 242, 
247, 301 
Echoic memory, 316 
Effect, law of, 64, 73, 91, 96 
Electrical stimulation of brain, 121-122 
Electric shock, 68, 99, 101-102 
and conditioning, 206-207 
inescapable and unavoidable, 164, 217 
response-produced and response-inde- 
pendent, 100 
Electroconvulsive shock, 322, 333 
Elicitation, 24, 28, 38-61 
and conditioning, 199-215 
and contrast, 190 
and escape, 106-107 
and punishment, 100-102 
and reinforcement, 76, 83-84 
and stimulus control, 31-32, 140 
Emission, 33-36, 39, 46, 200 
Emotion, 52, 83, 199-200, 206, 215-217 
anger, 216 
fear, 206, 209-210, 215-216 
joy, 217 
language of, 239 
Encoding, 308, 327, 332 
acoustic, 327 
and depth of processing, 329, 335 
by elaboration, 327-328 
and imagery, 341 
semantic, 328-330 
by substitution, 327-328 
Enhancement, conditioned, 217 
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Escape, 57, 93, 105-108, 353 
and conditioning, 207 
and elicited responding, 106-107 
extinction, 112 
Ethics, 97, 98, 115 
Ethology, 28 
Etymology, 6, 240 
Event record, 44, 72 
Evolution, 110, 112, 357-361 
Excitation, 149, 152, 189 
Exercise, 39, 55-59, 353 
Explanation, 11, 22, 75, 148-149, 216, 340, 
359 
Exploratory behavior, 89 
Extinction, 75, 79-85, 95, 98, 184 
in conditioning, 209 
and gradients, 147 
and inhibition, 79-82, 153 
in multiple schedule, 140-141 
and negative reinforcement, 112-113 
and recovery from punishment, 95, 98 
and superstition, 83-85 
after VI and VR, 172-173 
Eye-blink, 44 
Eye movement, 21, 64, 90-91 
and attention, 339-340 


Facilitation, 52-55, 59 
Fading, 153-155, 165 
and errorless discrimination, 154 


and verbal discrimination, 294-295 
Fatigue, 81 


Feedback, 278 

Feeder, 68-69 

FI schedule, 144, 

218, 247 

Fish, tropical, 55 

Fixation, 303 

Fixed action pattern, 28 

Fixed-interval schedule (see FI schedule) 

Fixed-ratio schedule (see FR schedule) 

Fixed-time schedule (see FT schedule) 

Forgetting, 283, 307, 308, 320-323 
instructed, 324-325 

Formal classes, verbal, 231-236, 301 

Freedom, 13, 196 

Free-operant behavior, 70 

Free recall, 280-281, 289-292, 296-297 324 
clustering, 297, 331 fl 
modified, 289 

Frequency, relative (see Probability) 

Frog, 53 

FR schedule, 8, 180-183, 184 

Frustration, 81, 188 

FT schedule, 180, 183, 184-185 


Function, 11-12, 23, 227, 337, 356- 
of memory, 335 8; 


in stimulus control, 165 
of verbal behavior, 230-25 3, 256, 258 


167, 180-183, 184-185, 


Index 


Function fixity, 350 
Fuzzy set, 159, 271 


Galvanic skin response, 52, 248 
Generalization, 139, 142-143, 144, 147- 
153, 165 
and deprivation, 147 
and schedule, 147 
in verbal behavior, 241 
vocabulary, 155 
Generalization gradient, 147-149, 151 
Genetics, 359 
Gerbil, 101 
Glandular secretion, 47, 120 55 
Gradient of stimulus control, 146-153, 155, 
170 
and deprivation, 147 
difference, 149 
excitatory, 149, 152 
generalization, 147-149, 151 
inhibitory, 149, 151-153 
post-discrimination, 149-151, 153 
and schedule, 147 
Grain, of cumulative record, 71-72 
Grammar, 137, 230, 255-268, 275 
Gull, 26-29, 56-57 


Habit, 17 
Habituation, 52-55 
Hallucination, 25, 338 
Hampton Court maze, 66 
Heart rate, 121-122 
Heat, as reinforcer, 107-108, 225, 354 
Helplessness, leamed, 164 
Herring gull (see Gull) 
Hierarchical organization in language, 
267, 356 
semantic, 271-272, 335 
Hierarchy: 
response, 18-21, 61, 78, 99, 117, hd 
habit-family, 18 
Higher-order schedule, 183 
Hobbits-and-orcs problem, 348-349 
Houselight, 68, 69, 181 
Hume, David, 282 
Hypotheses, 133 


266- 


Imagery and imagining, 338-342, 354 
eidetic, 341 
in mnemonics, 328 

Imitation, 26, 164, 231 

Ent, 39, 59-60 

ncubation, 329 

Induction, 119-127, 139, 142-143, 155 

Infant, 21-25, 43-44, 55-58, 91, 206 
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Inhibition, 17, 151 
in conditioning, 211-212 
in contrast, 189 
in extinction, 79-82, 153 
proactive, 302-303, 319 (see also Inter- 
ference) 
reflex, 46, 59 
retroactive, 302-303, 323 
Inhibitory gradient, 149, 151-153 
Innate behavior, 13, 26-28, 110 (see also 
Species-specific behavior) 
Insight, 26, 29, 64, 337 
Instinct, 39, 57 (see also Species-specific 
behavior) 
Instructions, 245-247, 250 
to forget, 324-325 
and memory, 331 
and verbal learning, 278, 284, 289, 291, 
297 
Intelligence, 26, 29, 64, 229 
Interaction, operant-respondent, 215-221 
Interference, 139, 277, 283 
and extinction, 81 
and memory, 308, 321-323, 331 
Proactive, 302-303, 319-320, 322-323, 
324, 333 
retroactive, 302-303, 323 
Interlocking schedule, 185-187 
Interresponse time (see IRT) 
Interval schedule (see FI schedule, VI 
schedule) 
Intestinal motility, 121-122, 214 
Intraverbal behavior, 242-244, 247, 253, 
257-258, 277 
Introspection, 338 
IRT, differential reinforcement of, 130-131, 
3 143, 185 
in VI and VR, 175-177 


Jumping stand (Lashley), 33-34 


Kemet, 264, 266 
Cy, pigeon, 8, 69, 117-119 
ltten, 89 
nL Jerk (see Reflex, patellar) 
oEEEE 4, 12, 337,357 
“Of results, 278 
Kohler, w., 64, 351 


Language, 134-135, 229-275 
nal SEait 268, 274-275 
vers - 
steno ক 273-274 
Lat ilerentiation, 127-129 
ent learning, 86-88, 226, 353, 357- 
358 
Laughing Eull (see Gull) 
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Laws (see Effect, Exercise, Matching, Re- 
flex) 
Leamed helplessness, 164 
Learning: 
all-or-none, 65, 85, 287 
intentional vs. incidental, 278 
latent, 86-88, 226, 353, 357-358 
observational, 164 
place- vs. response-, 146 
sensory-motor, 88-91 
set, 163-165, 299 
verbal (see Paired-associates learning, 
Recall, Recognition, Serial learn- 
ing, Verbal discrimination) 
Lexicon, 229, 255 
LH, 177, 184, 218 
Light, as aversive stimulus, 107 
Limited hold (see LH) 
Limits of learning, 135-136, 355 
and language, 273 
Linguistics, 229 
Link, 10, 195 
Listening, 244, 246, 247-250 
LSD, 99 


Maintenance of behavior, 8, 196 
Mand, 245, 248, 251 
Masochism, 102 


Massed practice, 283 
Matching, in concurrent schedules, 192- 
194 


Matching law, 193 
Matching-to-sample, 156-159, 248 
symbolic, 158 
Maximizing, Lat 
omentary, 1 
faze, 31, 64-68, 86-88, 131-133, 139, 146, 
287, 357-358 Se 
T-maze, 6667, 132, 146, 
Meaning, 135, 230, 248-249, 255, 263, 268- 
273, 335 oT 
componential analysis of, 
semantic differential, 270 
and transfer, 301 
in verbal learning, 282, 283, 286 
Mechanism, 357 
Mediation, language, 328 
rizing, 313 
Merery 289, 307-335, 354, 359-360 
consolidation, 322, 329 
decay vs. interference, 318-320 


echoic, 316 

episodic, 308, TE 

iconic, 313, 324, 

interference, 308, 321-323, 331 5 
long-term, 308, 313, 317, 320-323, 
335 


primary, 316 


’ 
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Memory (cont.) 


running, ৰ G43 
in tic, » 334-: 
Sh, 308, 313, 316-320, 324, 335 
span of immediate, 282, 314, 316, 320, 
327, 328, 345 
trace, 321, 330, 359 
working, 324 
Memory drum, 279 
Mentalism, 11-12, 357 
Mental representation, 346-347 
enactive, iconic, and symbolic, 346 
Mental rotation, 340-341 
Metabolism, conditioning of, 206 
Metamemory, 324-325, 335 
Metaphor, 139, 240-241, 275, 347 
search, 333,343 -345 
Storage and retrieval, 325-335 
Mill, James, 282 
Mind, 10, 12, 307, 357 
Mixed schedule, 187 
Mnemonic, 307, 309, 310-313, 325, 335, 
339 
method of loci, 31 1, 328, 331 
Peg-word technique, 312 
Molar theory, 112, 194 
Molecular theory, 112, 194 
Mongolian gerbil, 101 
Monkey, 49, 51, 53, 89, 101-102, 113, 163- 
165, 217, 304 
Monochromatic environment, 151 
Morse code, 235 
Motivation, 39, 60-61, 76, 226 
Motor memory, 334 
Motor skill, 119, 132, 283, 299, 346 
discrete vs, continuous, 334 
Motor theory Of thinking, 134 
Movement, 23-24, 65, 1 18, 339, 354 
Multiple schedule, 186-187, 195 
and stimulus control, 140-141 
Music, 134-135 
Myna bird, 232fn 


Naming, 241 
not tacting, 236 
Negation, 251 
Negative punishment, 113 (see also Punish- 
ment) 
Negative reinforcement, 104, 353 (see also 
Avoidance, Escape) 
ambiguity of, 107-108 
parameters, 105 
Vocabulary, 104 
Nervous system, 5, 40, 52 
Nonsense syllable, 243, 264, 270, 282 
Not-responding, as behavior, 1 14-115, 131 
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Novelty, 137 
and recognition, 296 
in semantics, 290 | 
in verbal behavior, 244, 252, 258, 274 
275 


j 1 
Observation, 18, 21, 24-26, 35, 226, 35 
Occasion (for response), 33, 36, 140, 165 
Oddity, 158 
Omission training, 113, 207-208, 213, 226, 

353 ED 
Operant, 32, 70, 273, 354 (see also Discrim: 
inated operant) 
as class, 357-359 
differentation of, 119-123 
and respondent, 199-200 
Operant level, 82 
Operation, 24-37, 222-223, 353-354 EEE 
also Consequences, Observa I 
Presentation of stimulus, Stimu 
control) 
Overlearning reversal effect, 303 
Overshadowing, 210-211 


Paired-associates learning, 243, 277, 280 
281, 285-289, 297 
and transfer, 299-301 
Paradigm, 63, 221-227 
Paramecium, 4-5 
Parameter, 105, 196, 217 
Paraphrase, 251, 263-264 115 
Parent, 23, 26-29, 58-59, 75, 94, 102, 115, 
231, 241, 247 
Parrot, 232fn 
Pause: 
Post-reinforcement, 8, 182 
in VR, 171 
Pavlov, LP., 42, 47, 80, 120 sai CHG) 
Pavlovian conditioning (see Conditioning, 
Peak shift, 149, 151 
Perception, 7 
Perceptual readiness or set, 339 
Performance, 358 
ETOUp average vs. individual, 67 
Phoneme, 232, 243 
Phrase Structure, 258-262 7 
Physiology, 5-6, 12fn, 40, 327, 356-35 118- 
Pigeon, 8-10, 49-51, 102-103, 110, 
119, 127-129 
and autoshaping, 213-214 
chamber, 68-70 
memory, 310 
and schedules, 171-197 
and stimulus control, 144-154, 156-160 
and tacting, 237-240, 252 
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Place-learning, 146 
Plurality, 251, 260, 273, 274-275 
Poetry, 232 
Polydipsio, 49 
Population (see Class) 
Porpoise, 137 
Positive punishment (see Punishment) 
Positive reinforcement (see Reinforcement) 
Post-discrimination gradient, 149-151, 153 
Post-reinforcement pause, 8, 182 
Posture, 24, 40, 49, 118, 119, 207 
Potentiation, 39, 52-55 
Practice, 55-56 
massed vs. spaced, 283 
Pragmatics, 255-256 
Preconditioning, sensory, 208-210, 226 
Predictability, in conditioning, 204-205, 
214,219 
Preference, 13, 20, 196 
Preparedness, 110, 160-162, 355 (see also 
Species-specific behavior) 
Presentation of stimulus, 24, 26-29, 35-37, 
222-223 (see also Elicitation) 
and stimulus control, 31-32, 199-215 
Primacy, 290, 291 
Primate, 25-26 
Prism, inverting, 89-90 
Private events: 
as cognitive processes, 338, 341, 346 
tacts of, 241-242 
Proaction, 277, 302-303 
and memory, 308, 319, 323,324, 333 
Probability, 42445, 111, 117, 125, 199, 227 
conditional, 44-45 
in conditioning, 205, 219 
of reinforcement, 172, 175-176 
Probe, 245 
Problem-box (Thorndike), 30, 64-65 
Problem-solving, 25-26, 29, 253, 330, 338, 
346, 348-351, 360 
Process, 24-35, 75 
Processing, cognitive, 338-347 
depth or level, 329 
stages, 342-345 
Production, verbal, 245, 268 
Prompt, 243, 245 
Pseudoconditioning, 205fn 
Psychological reality, 261, 265-266 
'sychology: 
behavioral, 11-12, 40, 42, 268, 338, 
340, 357-360 
cognitive, 11-12, 338-347, 357-360 
gestalt, 330 
Punisher, 74, 223 (see also Punishment) 
Punishing stimulus (see Punisher) 
Punishment, 24, 37, 64fn, 75, 93-104, 120, 
226, 337, 355, 360 
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and conditioning, 206 

and discriminative stimuli, 102-103 

effectiveness, 97 

and elicitation, 100-102 

negative, 113, 207-208 

permanence, 97 

and preparedness, 160 

and reinforcement, 94-96 

relativity, 99 

schedules, 185, 192 

vocabulary, 74fn, 94, 114 
Puzzle-box (Thorndike), 30, 64-65 


Quintilian, 311 


Raccoon, 136 
Rat, 18-21, 33-35, 48-49, 82-83, 119-124, 
140-143, 215-221, 304 
escape and avoidance, 57, 105-112 
maze-learning, 65-70, 86-87, 133, 146 
preparedness, 160-164 
response hierarchy, 18-21, 60, 77-78, 99- 
100 
Rate, response, 68, 197 
differentiation of, 131 
Ratio schedule (see FR, VR) 
Reaching, 33 
Reaction time, 340, 342 
Reading, 11-12, 156, 244, 247-250, 295, 
301, 356, 357 
not textual behavior, 234 
Recall, 117, 227, 309, 317, 360 
cued, 277, 331 
free, 280-281 (see also Free recall) 
serial, 291 
Recency, 290, 291 
Recognition, 277, 280, 282, 297, 360 
nonverbal, 295-296 
and recall, 321 
verbal, 295-297 
Recovery: 
after punishment, 95-96 
spontaneous, 80-82, 323 
Recursive structure, 266 
Redundancy, 283 
Reference, 229-230, 249-250, 255 
Reflex, 22, 39-48 
alimentary, 31 
Babinski, 43-44 
conditional, 32, 40, 42, 45 
laws of, 41 
patellar, 40, 205 
pupillary, 40, 206 
salivary, 199, 205, 207 
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Reflex (cont.) 
startle, 39-40, 51-53, 205fn 
walking, 57-59 
Reflex arc, 40 
Rehearsal: 
and iconic memory, 314-315 
and memory, 313, 324, 335, 344 
and short-term memory, 316-320 
in verbal learning, 291, 310 
Reinforcement, 8, 32, 37, 62-91, 93, 226, 
350, 360 
and conditioning, 202 
continuous, 167, 184 
differential, 118-127 (see also Differen- 
tial reinforcement) 
intermittent, 167 
negative, 93, 104-112 
parameters, 105 
partial, 167, 202 
permanence (see Extinction) 
and punishment, 94-95 
regular, 167 
relativity of (see Relativity) 
of verbal behavior, 244-245 
vocabulary, 73-76, 104 
Reinforcement schedules (see Schedules) 
Reinforcer, 74, 223, 226 (see also Re- 
inforcement) 
conditioned, 61 
Reinforcing stimulus (see Reinforcer) 
Relation, discrimination of, 156-159, 239 
Relativity: 
in elicitation, 201-202 
of punishment, 99 
of reinforcement, 64, 76-78, 88, 136, 
190, 225-226, 355 
Releaser, 28 
Remembering (see Memory) 
Reminiscence, 307, 322 
Repetition, 55-57 
Resistance to change, 79, 197 
Resistance to extinction, 79-80 
Respondent, 199-200, 358-359 
Respondent conditioning (see Conditioning) 
Response, 17, 22-23, 35, 39, 357 
aggressive, 82-83 
conditional (see CR) 
observing, 52, 157 
orienting, 52, 200-201 
unconditional (see UR) 
Response availability, 286-287 
Response hierarchy (see Hierarchy) 
Response integration, 286-287, 299, 303 
Response-learning, 65, 88, 146 
Response-shock (RS) interval, 108-109, 112 
Response strength, 131, 197 
Retention, 325, 329-330 
Retrieval, 308, 325-326, 329, 330-333 
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Retroaction, 277, 302-303, 308, 323 

Reversal, 303-304 

Reversibility of reinforcement (see Rela- 
tivity) 

Rhesus monkey, 217 

RS interval, 108-109, 112 

Running wheel, 19, 49, 77, 99 

Runway, 31, 67-68, 287 


Saccharin, 160-161 
St. Augustine, 330 
Salience, 143, 149, 210, 291 
Salivation, 31, 4041, 57, 121, 199, 200, 
205 
operant, 207-208 
Spontaneous, 121, 200 
Satiation, 118, 199 
semantic, 271 
Savings, method of, 285, 320, 321 
Scallop, 72, 180-181, 186, 247 
inverted, 185 
Schedules, 129-131, 167-197 
classified, 183-187 
and gradients, 147 
Search: 
exhaustive vs. self-terminating, 343-344 
memory, 333, 342-345 
random vs. fixed-order, 343, 344 
serial vs. parallel, 343-345 
visual, 292, 339 
Second-order schedule, 183 
Selection (sce Evolution) 
Self-control, 196 
Self-embedded Structure, 267 
Self-injury, 102, 115 
Semantic differential, 270 
Semantics, 250, 255, 263, 268-275, 337, 
359-360 
and memory, 308, 319-320, 327 
and syntax, 269 
and verbal learning, 292-294 
Sensitization, 53fn 
sensory-motor learning, 64, 88-91, 226, 353 
Sensory Preconditioning, 208-210, 226 
Sensory processes, 88-91, 338-342 
Sensory reinforcement, 89 
Sentence, 255, 270 
ambiguous, 258, 260, 268 ial 
Sequence, verbal (see Intraverbal, Seria 
learning) 2 
Sequential behavior, 10, 18, 42, 85-86, 13 
and acquisition, 164-165 5 
chains vs. temporally extended units, 
133-135 6- 
Sequential dependencies in language, 25 
258, 267 
Serial anticipation, 280, 283 
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Serial learning, 243, 277, 279-280, 282- 
286, 288, 297 
Serial-position effect, 283-284, 289 
Serial recall, 279-283 
Shaping, 34, 117-119, 125-127, 213, 360 
and fading, 154 
impossibility in conditioning, 200 
and verbal behavior, 232, 246 
of visualizing, 341 
vocabulary, 155 
Shift: 
extradimensional vs. intradimensional, 
304 
peak, 149, 151 
reversal vs. nonreversal, 304 
Shock (see Electric shock) 
Shock-shock (SS) interval, 108-109, 112 
Side effects: 
of extinction, 83 
of punishment, 99-103 
Signal, 24, 31-32, 36-37, 199, 354 
warning, 57, 111, 220 
Signalled avoidance, 57 
two-process theory, 110-112 
Sign language, 231, 248 
Sign-tracking, 214 
Similarity, 282, 283, 300-301, 305 
Simile, 240 
Simonides, 311 
Skinner, B. F., 241 
Sleep, 25, 283, 321, 329 
Slope of cumulative record, 70-71 
Spaced practice, 283 
Span of immediate memory (see Memory) 
Speaking, 229, 231-234, 246 
Species, 358-360 
Species peclic behavior, 13, 26-28, 70, 
101, 110, 136, 206, 354-359 
Species-specific defense reactions, 110, 206 
Spirit, 353 
Spontaneous behavior, 56-57, 121 
Spontaneous recovery, 80-82 
S-R bonds, 134, 135 
SS interval, 108-109, 112 
State-dependency, 308, 333 
Steady-state, 164, 196 
Stereotypy of responses, 84 
Stimulus, 17, 22-23, 35, 39, 78, 156, 222, 
357 
appetitive, 20, 223, 226 
aversive, 20, 61, 93, 114, 206, 223, 226 
comparison, 157-158 
conditional (see CS) 
contingent, 221-224 
discriminative (see Discriminative stimu- 
lus) 
functional, 278, 283 
neutral, 20, 88, 201-203, 224, 226 
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nominal, 278 
preaversive, 216-221 
sample, 156-158 
supernormal, 27 
unconditional (see US) 
warning, 57, 111, 220 
Stimulus control, 24, 222-223, 337, 351, 
354, 356 (see also Discriminated 
operant, Discrimination, Gradi- 
ent, Multiple schedule) 
and attention, 143-145, 153, 154 
in conditioning, 199-215 
and consequences, 32-37 
and elicitation, 31-32, 35-37 
and fading, 153-155 
in tacting, 236-242, 256 
and verbal learning, 286, 289 
Stimulus-learning, 65, 88 
Stimulus-presentation operation (see Presen- 
tation of stimulus) 
Stimulus-response bonds, 134, 135 
Storage, 308, 325-329 
Straight alley, 31, 67-68, 287 
Strain, ratio, 10, 173 
Strength, reflex, 41-42 
Strength, response, 131, 197 
Structure, 11-12, 23, 28, 137, 183, 337, 
354, 356-358 
deep vs. surface, 263 
hierarchical, 266-267, 271-272, 346-347 
of lists, 289 
and memory, 308, 327, 335 
phrase, 258-262 
and problem-solving, 348 
of verbal behavior, 230, 243, 255-273 
Subvocal responses, 233, 234 
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